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INCREASING TOE TREHALOSE CONTENT OP ORGANISMS BY TRANSFORMING 
THEM WITH COMBINATIONS OF THE STRUCTURAL GENES FOR TREHALOSE 
SYNTHASE. 

FIELD OF INVENTION 

This invention relates to the genetic engineering of the 
trehalose synthetic pathway of yeasts , such as baker 's and 
distiller's yeasts, and to the transfer of this pathway by 
genetic engineering to other organisms. In particular, the 
present invention concerns trehalose synthase, novel genes 
encoding the trehalose synthase, novel vectors containing the 
novel genes, and host cells and organisms transformed with the 
novel vectors. The invention also relates to the production of 
trehalose and ethanol and to the improvement of the stress 
resistance of organisms, in particular yeasts and crop plants. 

BACKGROUND OF THE INVENTION: 

It is well known that sugars and other polyhydric compounds 
stabilize isolated proteins and phospholipid membranes during 
dehydration, probably by replacing the water molecules that are 
hydrogen-bonded to these macromolecules [reviewed* by Crowe, 
J.H. et al. (1987) Biochemical Journal 242, 1-10]. Trehalose 
(a-glucopyranosyl-a-D-glucopyranose) is a dimer of two glucose 
molecules linked through their reducing groups. Because it has 
no reducing groups, it does not take part in the Halliard 
reactions that cause many sugars to damage proteins, and it is 
one of the most effective known protectants of proteins and 
biological membranes in vitro . 

In nature, trehalose is found at high concentrations in yeasts 
and other fungi, some bacteria, insects, and some literal 
animals, such as the brine shrimp* It is notable that all thes 
organisms are frequently exposed to osmotic and dehydration 
stress. Accumulation of trehalose in higher plants is rare, but 
high levels occur in the so-called resurrection plants, such as 
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varying the growth conditions or administ ring h at shocks, 
high positive correlations have been found betwe n the 
trehalose cont nt o£ the cells and their r sistance to 
dehydration [Gadd, G. et al . (1987) Federation of European 
Microbiological Societies Microbiological Letters 48, 249-254], 
heat stress [Hottiger, T» et al., (1987) Federation of European 
Biochemical Societies Letters 220, 113-115] and freezing 
[G61inas, P. et al . Applied and Environmental Microbiology 55, 
2453-2459]. Also, strains of S. cerevisiae and other yeasts 
selected for resistance to osmotic stress [D'Amore, T. et al. 
(1991) Journal of Industrial Microbiology 7, 191-196] or high 
performance in frozen dough fermentation [Oda, Y. (1986) 
Applied and Environmental Microbiology 52, 941-943] were found 
to have 'unusually high trehalose contents. Furthermore, a 
mutation in the cyclic AMP signaling system of S. cerevisiae 
that causes constitutive high trehalose levels also causes 
constitutive thermotolerance, whereas another mutation in this 
system that prevents the usual rise in trehalose during heat 
shock also prevents the acquisition of thermotolerance 
[Hottiger, T. et al. . (1989) Federation of European Biochemical 
Societies Letters 255, 431-434] . Thus, there is much evidence 
pointing to a connection between trehalose content and. stress 
resistance in yeasts, especially S. cerevisiae . Similar 
findings have been made for several other fungi [e.g., Neves, 
M.J., Jorge, J. A. , Francois, J.M. & Terenzl, H.F. (1991) 
Federation of European Biochemical Societies Letters 283, 
19-22]. However, a causative relationship has not yet been 
demonstrated. Further, nearly all conditions that cause 
increases in the trehalose content of yeast also cause 
increases in the levels of the so-called heat shock proteins. 
The 1989 publication of Hottiger and colleagues, 
cited above, claims that canavanine does not cause an increase 
in either trehalose levels or thermotolerance, whereas this 
compound is reported to induce heat shock proteins. 

Whether or not ther is a causal relation b tween trehalose 
content and str ss resistance, it has become general practice 
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in the manufacture of baker's yeast to maximise the trehalose 
content of the yeast. Various maturation processes have been 
develop d to achieve this aim, and they ar of crucial 
importanc in the manufactxire of active dried yeast. The 
5 details of these processes are often secret, but they are 

generally empirical regimes in which carbon and nitrogen feeds, 
aeration and temperature are carefully controlled and selected 
strains of yeast are used. They demand time and energy inputs 
during which little increase in cell number occurs. A more 
10 rational and controlled process would be of economic benefit. 

According to Cabib f E. & Leloir, L.F. [(1957) Journal of 
Biological Chemistry 231 , 259-275] r trehalose is synthesized in 
yeast from uridine diphosphoglucose (DDPG) and glucose- 

15 -6-phosphate (G6P) by the sequential action of two enzyme 
activities, trehalose-6-phosphate synthase and trehalose- 
's -phosphate phosphatase. Londesborough, J.St Vuorio, 0. [(1991) 
Journal of Microbiology 137, 323-330, expressly incorporated 
herein by reference] have purified from baker's yeast a 

20 proteolytically modified protein complex that exhibited both 
these activities and appeared to contain a short polypeptide 
chain (57. kDa) and two truncated versions (86 JcDa and 93 kDa) 
of a long polypeptide chain. The intact long chain was 
estimated to have a mass of at least 115 kDa. It was not 

25 disclosed which enzyme activity or activities was associated 
with which polypeptide, nor indeed whether both polypeptide 
were essential for either or both enzymatic activities. 
Anti-sera against both polypeptides were reported, but no amino 
acid sequences were disclosed. 

30 

An earlier patent application [EP 451 896; see Claim 1] has 
claimed for a transformed yeast "comprising. . . . one gene 
encoding. . . .trehalose-6-phosphate synthase" . However, no 
information about the either the gene* or the protein it encodes 
35 was provided. 



Several authors have reported increases in TPS activity in 



conditions that 1 ad to accumulation of trehalose by S. 
cerevisiae , and Schizosaccharomyces porobe both during th 
approach to stationary phase [Winkler, K., et al . (1991) 
Federation of European Biochemical Soci ties Lett rs 291, 269- 
272; Francois, J., et al, (1991) Yeast 7, 575-587] and after 
temperature shift-ups to about 40 °C [De Virgilio, C, et al. 

(1990) Federation of European Biochemical, Societies Letters 
273, 107-110]. Panek and her colleagues [Panek, A.C, et ^1. 
(1987) Current Genetics 11, 459-465] have claimed that TPS 
activity is increased by dephosphorylation of pre-existing 
enzyme molecules, i.e., that it is the result of 
post-translational regulation. This claim has been challenged 
[Vandercammen, A., et al. f (1989) European Journal of 
Biochemistry 182, 613-620] but continues to be made [Panek, 

A.D. & Panek, A.C. (1990) Journal of Biotechnology 14, 

« 

229-238]. Evidence for or against an increase in the amount of 
enzyme during trehalose accumulation is conflicting* Inhibitors 
of mRNA synthesis inhibited trehalose accumulation by s. 
cerevisiae shifted from 30 to 45 °C [Attfield (1987) loc.cit.], 
whereas under very similar conditions Winkler et al [(1991) 
loc.cit.] found that cycloheximide (an inhibitor of protein 
synthesis) did not prevent the accumulation of trehalose, 
which, however, occurred without an observable increase in TPS 
activity. In a lower temperature range (a shift from 23 to 36 
°C) , trehalose accumulation was accompanied by a three- fold 
increase in TPS activity, and cycloheximide prevented the 
increase in TPS [Panek, A.C. , et al. (1990) Biochemie 72, 
77-79]* In Schizosaccharomyces oombe . [De Virgilio, C, et al, 

(1991) loc. cit.] temperature shiftup caused a large 
accumulation of trehalose and increase of TPS which were not 
prevented by cycloheximide, leading the authors to suggest that 
in this yeast a post-translational activation occurs. We now 
disclose that in S. cerevisiae the co-ordinate increases in TPS 
and TPP activities during exhaustion of glucose are accompanied 
by an increase in antig nic material recognized by anti-sera to 
the short and long chains of a purified trehalose synthase. 
Hence, a method to increase the trehalose content of cells, and 
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so r their stress tolerance, would be to isolate, clone, and 
modify the structural gen s (hereinafter referred to as TSS1, 
TSL1, and TSL2) of th se polypeptides and cause their 
expression in yeast or other host cells under the control of 
5 suitable promoters. If the expression of these genes could be 
controlled, then so could the trehalose content of the host 
cells. 



The well known metabolic theory of Kacser & Burns [(1973) 

10 Symposium of the Society of Experimental Biology 27, 65-107] 
teaches that in principle the concentration of any 
intermediate, such as trehalose, can be increased by increasing 
the amount of any enzyme synthesizing it or decreasing the 
amount of any enzyme degrading it, but that the size of the 

15 increase may not be significant. The novelty of the present 
invention* lies in the identification and characterization of 
the particular yeast genes that must be modified to increase 
the amounts of trehalose synthase and the recognition of the 
advantages of modifying the synthetic pathway rather than the 

20 degradative pathway. These advantages include (i) leaving the 
highly regulated [see, e.g., Thevelein, J^M. (1988) 
Experimental Mycology 12, 1-12) degradative pathway intact to 
avoid the physiological problems likely in yeast that cannot 
activate this pathway, (ii) the possibility of causing yeast to 

25 synthesize trehalose under physiological conditions where wild 
type yeasts do not (so that blocking the degradative pathway 
cannot increase the amount of trehalose) and (iii) the 
important possibility of introducing by these genes a 
trehalose-synthetiq capacity to organisms, such as most higher 

30 plants, that naturally lack this capacity. j 

$ 

Expression of the genes for trehalose synthesis in yeast under j 
conditions where trehalase is active will increase the 
operation of a so-called "futile" cycle, in which glucose is 
35 continuously phosphorylated, converted to trehalos and 
regenerated by hydrolysis of the trehalose, resulting in 
increased consumption of ATP. This ATP must be regenerated, and 
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of diagnostic kits, viruses and other protein material [WO 
87/00196] and, potentially r as a sourc of anti-tumour agents 
[Ohtsuro et al. (1991) Immunology 74 , 497-503). Trehalose for 
internal applications in humans would be much more saf ly 
obtained from yeast than from a genetically engineered 
bacterium. 

Second f by transferring these genes to higher plants after 
making suitable modifications obvious to anyone skilled in the 
art (in general, replacements of adenine/ thymine base pairs by 
guanine/ cytosine base pairs as suggested by Perlak et al. 
[(1991) Proceedings of the National Academy of Sciences of the 
U.S. A, 88, 3324-3328] and the introduction of suitable 
promoters, some of which may be* tissue-specific, to direct the 
synthesis of trehalose to frost and drought-sensitive tissues), 
the resistance of the plants to various stresses , especially 
frost and dehydration , should be improved. The economic 
importance of such improvements is potentially enormous, 
because even small increases in cold-tolerance will lead to 
large increases in growing season, whereas dehydration 
resistance can save entire crops in time of drought. Frost and 
drought resistance in higher plants is usually accompanied by 
increases in compounds such as proline rather than trehalose 
[reviewed by Stewart (1989) in "Plants under Stress 1 *, pp 
115-1301, but, as mentioned above, resurrection plants 
accumulate large amounts of trehalose and there seems, a 
priori * to be no reason why this strategy should not be 
successful. Therefore, the present invention includes a process 
to transform crop plants by introducing recombinant forms of 
the structural genes of yeast trehalose synthase (TSSl, TSL1 
and TSL2) so as to increase the trehalose content of some of 
their tissues compared to those of the parent plant. Such 
transformed plants can also be economic and safe sources of 
trehalose. Third, the shelf-life of food products can be 
increased by adding trehalose to them [WO 89/00012]. A further 
aspect of the present invention is a novel process for 
producing trehalose-enriched food products from plants by 
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causing them to xpress the structural genes for yeast • 
trehalose synthase in their edible tissues. 

SUMMARY OF INVENTION: 

5 

The present invention provides two isolated genes encoding, 
respectively a short and a long chain of yeast trehalose 
synthase and a third gene encoding a 99 kDa polypeptide that 
occurs in some trehalose synthase preparations and has 

10 trehalose-6-phosphatase activity. These genes can be used to 

transform an organism (such as a yeast, other fungus or higher 
eukaryote) , whereby the transformed organism produces more 
trehalose synthase resulting in a trehalose content higher than 
that of the parent organism * The higher trehalose content 

15 confers improved stress resistance and storage properties on 
the transformed organism as compared to the parent organism, 
and the transformed organism can be used to provide large 
quantities of trehalose. Thus, a process for producing a crop 
plant which has increased resistance to water deprivation, heat 

20 ' and cold, comprises transforming the plant by introducing at 
least one of the novel genes into the plant's tissue. 

The invention also provides a trehalose synthase which exhibits 
trehalose-6-phosphate synthase activity activatable by 
25 fructose-6-phosphate and also trehalose-6-phosphatase activity. 

Finally, the invention provides processes for producing 
trehalose by cultivating a host or an organism which has been 
transformed with at least one of the novel genes, and processes 
30 for producing trehalose enriched food products from plants by 
introducing at least one of the novel genes and allowing said 

o 

genes to express the trehalose synthase in the edible tissues 
of the plant. 
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BRIEF DESCRIPTION OF FIGURES 

Fig. 1. SDS-PAGE of intact: trehalo se synthase 

5 A 6-13 %T gradient gel was used. Lane 1 contains 8.3 /ig of 
intact trehalose synthase eluted from the UDPG-Glucuronate- 
Agarose column with 0.2 M NaCl (#11 of Table 1) . Lanes 2, 3 and 
4 contain, respectively, 7.7, 12 and 1.0 /xg of enzyme eluted 
from the column with 0.4 M NaCl containing 10 mM UDPG (#13 , #14 

10 and #15 from Table 1) . Lane 5 contains about 1 /xg each of 
molecular mass markers (myosin f B-galactosidase r 
a-phosphorylase, BSA, ovalbumin, lactate dehydrogenase, 
triosephosphate-isomerase, myoglobin and cytochrome c) . The 
major polypeptides of intact trehalose synthase are named on 

15 the left and the molecular mass calibration, in kDa, is shown 
on the right. 

Fig. 2. SDS-PAGE of immunoprecipitates of wild-type veast grown 
on YP/2 % glucose 

20 

A 9 %T gel was used. Lane 1 contains about 1 /xg each of the 
molecular mass markers used in Fig 1. Lanes 2, 3 and 4 contain 
immunoprecipitates from 3.8 mg fresh yeast harvested after 16.1 
h (1.2 % residual glucose}, 18.1 h (no residual glucose) and 39 
25 h. The molecular mass calibration is shown on the left .and the 
major polypeptides of trehalose synthase and the heavy chain of 
y-globulin are shown on the right. 

Fig. 3. The promoter and terminator of TSS1 and the amino acid 
30 sequence deduced from its ORF. 

(a) In the promoter and terminator regions, the start ATG and 
tandem TGA stop codons are double underlined and a TATA box and 
putative catabolite repression element are underlined, (b) In 
35 the amino acid sequence (SEQ ID NO: 2), the sequences found in 
peptides isolated from the short chain of tr halose synthase 
eure underlined. 
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Fig. 4. The promoter and t erminator regions of TSL1 and the 
amino acid sequenc e deduced from its ORF. 
(a) In the promoter region , the start ATG codon is doubl 
underlined and two TATA boxes and six putative heat shock 
5 elem nts ar und r lined, A putativ MIG1 binding site is 
over lined. In the terminator region, the TAA stop codon is 
double underlined and a putative transcription termination 
element is underlined. Lower case letters show the end of the 
terminator region of the ARGRII gene, which has opposite 
10 polarity, (b) In the amino acid sequence (SEQ ID NO: 82), 

sequences found in peptides isolated from (fragments of) the 
123 kDa long chain are underlined, and those from peptides 
liberated from intact trehalose synthase by limited digestion 
with trypsin are underlined and bold. 

15 

Fig. 5. Alignment of the amino acid sequences of the short and 
loner chains of trehalose synthase 

The complete short chain sequence (SEQ ID NO: 2; the upper 
20 sequence) is aligned against residues 320 to 814 of the 123 kDa 
long chain (SEQ ID NO: 4; the lower sequence). 32 gaps are 
introduced to optimize the alignment. Vertical dashes indicate 
identical residues. Colons indicate conservative substitutions* 

25 Fig. 6. Important restriction sites in TSS1 and TSL1 

The heavy lines indicate open reading frames. The scale bar 
shows one kb. 

30 Fig. 7. Synthesis of r u C1 -trehalose from ru- 14 C1 -glucose 
6-phosohate bv an extract of wild-type veast 

Reaction mixtures (100 /il) contained 40 mM HEPES/KOH pH 6.8, 1 
mg BSA/ml, 10 mM MgCl 2 10 mM [U- 14 C]-G6P (736 c.p.m./nmol) and 
35 (a) no phosphate or (b) 5 mM K phosphate pH 6,8 and (0) 5 mM 
UDPG, (•) 2.5 slH ADPG or (O) neither UDPG nor ADPG. Reactions 
were start d by adding 10 fil ( quivalent to 94 fig fresh yeast) 
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of a 28,000 g supernatant of stationary phase X2180. Reactions 
were stopped by transfer to boiling water for 2 min and 
addition of 1.0 ml of a slurry of AG1-X8 (format ) anion 
exchange resin [Lond sborough & Vuorio (1991) loc cit.]- The 
5 radioactivity in the resin supernatant was measured. 

Fig. 8. Western analysis of KIct 102 and X2180 veasts 

Growth of - the yeasts is described in Example 7 . The loads of 
10 fresh yeast per lane were: lane 1, 200 fig X2180/2; lanes 2 and 
5, 330 fig 2669/1: lanes 3 and 6 r 610 fig 2669/2; lanes 4 and 7, 
810 fig 2670/1+2; lane 8, 560 fig X2180/1 and lane 9, 280 fig 
X2180/1. The blot was probed with anti-TPS/P serum at a 
dilution of 1/30,000, Major bands of trehalose synthase are 
15 identified on the right. 

Fig. 9. Treatment of truncated trehalose synthase with 1.9 mM 
HM 

20 Truncated enzyme (0.13 TPS units/ml » 43 pqf ml) in 2 mg BSA/ml 
50 mM HEPES pH 7.0 containing 67 mM NaCl, 0.2 mM EDTA, 0.17 mM 
dithiothreitol, 0.17 mM benzamidine and 1.7 mM UDPG was 
incubated at 24 °C with (closed symbols) or without (open 
symbols) 1.9 mM NEM. TPS (t f O] and TPP (M r D) activities were 

25 measured, - 

Fig. 10. Autoradiooram of truncated trehalose synthase labelled 
with r 14 C1-NEM and separated bv SDS-PAGE 

30 Labelling was performed as described in Example 8 for 1~5, 

10. 5 , 63 and 190 min in lanes l f 2, 3 and 4, respectively. The 
positions of the (57 kDa) short chain, 93 and 86 kDa long chain 
fragments and the carrier BSA are indicated. 
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Fig. 11. Treatment of truncated tre halose synthase with ethvl- 
labelled NEM . 

Truncat d enzym (7.2 TPS units/ml « 0.24 mg/ml) in 1 mg BSA/ml 
5 25 mM HEPES pH 7.0 containing 2 mM MgC12, 1 mM EDTA and 0.2 M 
NaCl was incubated at 23 °C with (solid symbols) or without 
(open symbols) 32 /iM ethyl-labelled NEM. TPS (t,0] and TPP 
(■,□) activities and the amounts of [ 14 C]-NEM incorporated into 
the 93 (a) , 86 (+) and 57 (X) kDa polypeptides were measured. 
10 0*1 mol NEM incorporated per mol (150 Kg) of enzyme corresponds 
to an excess radioactivity of 75 c.p.m. in bands cut from the 
gel. 
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Fig. 12. Stoichiometrv of NEM labelling 

Residual TPP activity is plotted against the amount of NEM 
incorporated to the 93 and 86 kDa fragments of the long chain. 
Ring-labelled (•) and ethyl-labelled (0) NEM were used. 

20 Fig. 13. SPS-PA<?E qpalYsjg <?f fractigns ejutefl f yo^ the 
cellulose-phosphate with buffer containing 0.3 % Triton 

Lane L contains 47 ,p,l of the intact trehalose synthase applied 
to the column. Lane M contains about 1 each of the molecular 

25 mass markers used in Fig 1. The numbered lanes contain 33 Ml of 
selected 1.5 ml fractions eluted from the column. The NaCl 
gradient began to appear in fraction 6 and reached 300 mM at 
fraction 27. A step to 600 mM NaCl emerged between fractions 36 
and 37. Fractions 40 to 42 were eluted with 200 mM K phosphat . 

30 The major bands in the trehalose synthase preparation are 
identified on the left. Details are given in Example 9. 

Fig. 14. In vitro activation of trehalose synthase bv limited 
trvptic digestion 
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Intact trehalose synthase was incubated with (solid symbols) 
and without (open symbols) trypsin and its TPS activity 
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measured in the presence of 5 mM F6P in reaction mixtur s. 
containing (0, ) no phosphate or (■) 5 mM K phosphate pH 6,8. 
Details are given in Example 10. 

5 Fig. 15. ivimf t^d trvntic digestion of intact trehalose synthase 

Lane 1 contains the untreated trehalose synthase used in Fig. 
15 and lane 2 the same amount of enzyme after 48 min treatment 
with trypsin. Lane 3 contains molecular mass standards. The 
10 major polypeptides of trehalose synthase are identified on the 
left. 

Fig. 16. The effect of fructose 6-phosphate on the TPS activity 
of intact trehalose synthase, at different phosphate 
15 concentrations 

The TPS activity of native trehalose synthase was measured 
between zero and 10- mM F6P. Other conditions were as in the 
standard TPS assay with (•) no changes, (O) 1.3 mM K phosphate 
20 " pH 6*8 added or (1) 4 mM K phosphate pH 6.8 and 0.1 M KC1 added 
and the MgCl 2 concentration decreased to 2.5 mM. Activities are 
shown as percentages of that in the standard assay (i.e., at 5 
mM F6P and no phosphate) . 

25 Fig. 17. Activation of the TPP activities of intact and 
truncated trehalose synthase bv phosphate 

TPP activities were measured at 0.5 mM [ 14 C]-trehalose-6- 
phosphate in assay mixtures containing 50 mM Hepes pH 6.8, 1 mg 

30 bovine albumin/ml and the indicated concentrations of K 

phosphate pH 6.8 and are shown as percentages of the standard 
TPS activity. Initial rates are shown for the (I) intact and 
(•) truncated enzyme. Rates during the second five minutes of 
the accelerating reaction obtained with truncated enzyme are 

35 also shown (O) . 
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Fig. 18. Phosphate-dependenc e of the TPP activity of intact 
trehalose synthase 

The r ciprocal of the incr ase in rate (V A ) caus d by the 
5 phosphate is plotted against (0) [phosphate]' 2 or (•) 

[phosphate] " 1 . V A is shown as a percentage of the standard TPS 
activity. 

Fig. 19 , Western analyses of E. coli transformed with TSS1 and 
10 TSL1 

The gels were loaded with samples of whole homogenates (HOM) 
equivalent to 300 ± 12 fig fresh cells or 28. 000 g supernatants 
(SUP) equivalent to 340 ± 25 fig fresh cells. The letters above . 

15 the lanes indicate the cell types: K, control (HB101) cells; L, 
ALK03568 (HB101 transformed with TSL1) ; S, ALK03566 "(HB101 
transformed with TSS1) . Gel 1 was probed with anti-57K serum 
(1/20 000) and gel 2 with anti-93K serum (1/20 000) . The 
positions of the 57 kDa short chain and about 60, 36 and 35 kDa 

20 fragments of the 123 kDa long chain are shown. Molecular mass 
standards are labelled in kDas. 

Fig. 20. Plasmids containing TSS1 and TSL1 

25 pBluescript containing (a) TSSl with its own promoter, # (b) TSSl 
without its promoter and (c) TSL1 with its own promoter are 
shown. 

Fig. 21. Southern analysis of two tssl disruotants of 
30 S.cerevisiae. 

Clal digests of DNA from control yeast (S150-2B; lanes 2,5 and 
9), and two tssl disruptants, ALKO 3569 (lanes 3, 6 and 10) and 
ALKO 3570 (lanes 4,7 and 11) were probed with TSSl (lanes 2 to 
35 4) t LEU2 (lanes 5 to 7) and TSL1 (lanes 9 to 11)* Lanes 1 and 8 
contain DNA standards 
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nCTATLED DESCRIPT ION OF THE INVENTION: 



In the following description , trehalose-6-phosphate synthase 
5 (TPS) and trehalose-6-phosphate phosphatase (TPP) refer to 
catalytic activities , not to proteins, unless specifically 
stated otherwise, whereas trehalose synthase refers to a 
protein that can convert uridine diphosphoglucose (DDPG) and 
glucose-6-phosphate (G6P) into trehalose, and also exhibits as 

10 partial reactions TPS and TPP activities. TSS1, TSL1, and TSL2 
are structural genes that encode, repectively, the short (57 
kDa) and the about 130 and 99 kDa long chains of trehalose 
synthase. It is well known that mutations occur in genes and 
can cause changes in the amino acid sequence of the encoded 

15 polypeptide* Changes can also be introduced by genetic 

engineering techniques. As used herein, the term TSS1 (or TSL1 
or TSL2) gene includes all DNA sequences homologous with the 
sequences herein disclosed for TSS1 (or TSL1 or TSL2) and 
encoding polypeptides with the functional or structural 

20 properties of the 57 kDa (or about 130 kDa or 99 kDa, 

respectively) polypeptide. Sequences articially derived from 
these genes but still encoding polypeptides with the desired 
functional or structural properties are also included. 

25 The present inventors previously reported the isolation of a 
partially degraded protein preparation that contained a short 
(57 kDa) polypeptide chain and two fragments (86 and 93 kDa) of 
a long polypeptide chain and possessed both TPS and TPP 
catalytic activities [Londesborough, J & Vuorio, 0. (1991) 

30 Journal of General Microbiology 137, 323-330]. The size of the 
full-length intact long chain, from which both the 86 and 93 
kDa fragments were then believed to be derived, and whether one 
or other polypeptide possessed one or other of the catalytic 
activities were not known at that time. 

35 

The inventors have now isolated an undegraded trehalose 
synthase that contains the 57 kDa short chain, and two long 
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chains of about 130 kDa and 99 kDa as its major polypeptides. 
Trac s of oth r polypeptides are also pr sent that app ar to be 
degradation products of the about 130 and 99 kDa chains. Two 
genes, TSS1 and TSL1, that encode, respectively, the short and 
5 about 130 kDa long chains have been clon d and s quenced* 

Because the size of this long chain is now known from its gene 
to be 123 kDa, it is hereafter called the 123 kDa long chain* 
TSS1 encodes a polypeptide with a theoretical molecular weight 
of 56.2 kDa; however, this short chain and the 99 kDa long 
10 chain are still called after their apparent molecular weights 

by SDS-PAGE analysis, the error in such analyses being at least 
± 10 kDa at 130 kDa. 

The sequences of TSS1 and the polypeptide it encodes are 

15 disclosed as SEQ ID NOS:l and 2, respectively. The sequences of 
TSL1 and the polypeptide it encodes are disclosed as SEQ ID 
NOS:83 and 82, respectively (earlier versions of these 
sequences, lacking the 5'- and N-terminal regions, are listed . 
as SEQ ID NOS:3 and 4) • Genetic evidence is disclosed that 

20 shows that a functional TSS1 gene is involved in the expression 
of both TPS and TPP catalytic activities in S* cerevisiae ; (1) 
both activities are absent from a mutant strain (Klg 102) that 
lacks a properly functional TSS1 gene and does not express the 
short chain in a form recognizable in Western blots although it 

25 does express immunologically recognisable long chain; (2) 

disruption of TSS1 eliminates TPS and TPP activities, abolishes 
the short chain signal from Western blots and prevents the 
accumulation of trehalose, and these defects are simultaneously 
reversed by transformation with TSS1, which also increased the 

30 resistance of the cells to freezing stress; and (3) 

transformation of Escherichia coli with TSS1 causes a large 
increase in the TPS activity of the transf ormants (but no 
detected increase in their TPP activity) . 

35 We disclose biochemical evidence that the TPP catalytic 
activity of a truncated trehalose synthase requires a 
functional long chain: incorporation of about 1 mole of 
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14 C-N-ethylmal imide into the 93 kDa long chain fragment per 
mole of truncated trehalose synthas results in complete loss 
of TPP activity but only a slight loss of TPS activity. 
Furthermore, we have been able to is late th 99 kDa 
5 polypeptide and show that it poss sses residual TPP activity 
but no TPS activity. Also, intact trehalose synthase is 
partially resolved into a 99 kDa-enriched form with a 
relatively high TPP /TPS ratio and a 123 kDa-enriched form with 
a lower TPP/TPS ratio. However, truncation of the 123 kDa long 

10 chain has dramatic and important effects on the TPS activity of 
trehalose synthase: removal of the N-terminal 330 or so amino 
acids decreases the sensitivity of the TPS catalytic activity 
to inhibition by phosphate and almost eliminates its activation 
by fructose-6-phosphate* Further, transformation of E. coli 

15 with TSLl causes .an increase in the TPS activity of the 

trans formants (but no detected increase in their TPP activity) . 

Thus, both the short and the long chains make essential 
contributions to both the TPS and the TPP catalytic activities 
20 of trehalose synthase. The situation is therefore that there 
are at least two different structural genes for a trehalose 
synthase, neither of which can be completely described as the 
structural gene of either a trehalose~6-phosphate synthase 
protein or a trehalose-6-phosphate phosphatase protein. 

25 

We disclose that the amino acid sequences of peptides isolated 
from both the 86 and 93 kDa long chain fragments found in the 
truncated enzyme described by Londesborough & Vuorio [(1991) 
loc. cit] are encoded by TSLl.. Surprisingly, however, none of 

30 the peptides isolated from the 99 kDa polypeptide in the intact 
enzyme is encoded by TSLl. Therefore, the structural genes 
encoding polypeptides of yeast trehalose synthase include a 
third member, TSL2. The 99 kDa polypeptide encoded by TSL2 was 
not visible in SDS-PAGE analyses of truncated enzyme. However, 

35 one of the 6 peptides isolated from the 93 kDa fragment was not 
encoded by TSLl and had an amino acid sequence also found in a 
peptide isolated from the 99 kDa polypeptide. Thus, traces of a 
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d gradation product of th 99 kDa polyp ptid are pres nt in 
truncated nzym , and migrate with the 93 kDa fragment during 
SDS-PAGE. 

5 The inventors have not yet sequenced this third structural 
gene, TSL2, but disclose information that provides obvious 
methods for its isolation and cloning by a person ordinarily 
skilled in the art. Also a clone (pALK7756) comprising at least 
part of this gene has been deposited (Accession number, DSM 
10 7425; Deutsche Sammlung von Mikroorganismen und Zellkulturen, 
Mascheroder Weg 1 B, D-3300 Brauschweig) 

We disclose that the genes TSS1 and TSL1 contain extensive 
similarities such that the amino acid sequence of the entire 
15 short chain is 37 % identical to residues 495 to 814 of the 
long chain. 

A novel feature of the present invention, therefore, is that in 
order to increase the capacity of a yeast or some other host 

20 * organism for trehalose synthesis it can be necessary to 

increase the expression of both the TSS1 and the TSL1 and TSL2 
genes or modify these genes in some, other way, not because 
either TPS or TPP activity is "rate-limiting", but because more 
than one gene affects each activity. Thus, the results 

25 summarised above disclose that both TSS1 and TSL1 affect TPS 
activity and both TSS1 and TSL2 affect TPP activity. However, 
these results also disclose that the TSL2 gene product (the 99 
kDa polypeptide) isolated by chromatography is itself a 
trehalose-6-phosphatase whereas the TSS1 gene product expressed 

30 in E. coli is a trehalose-6-phosphate synthase, although the 

catalytic efficiency of these separate polypeptides can be less 
than when they are correctly assembled in a trehalose synthase 
complex. 

35 A surprising finding was that the TSS1 g ne is identical with a 
gene variously called FDP1 or CIF1. This gene has pleiotropic 
effects on the utilization of sugars by S. cerevisiae . In 
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particular, haploid yeast bearing certain alleles of this gen 
(the so-called fdpl and cifl mutants) are unable to grow on 
mannose, or on mannose or sucrose, or on mannose, sucros or 
fructose, or on mannose , sucrose, fructose or glucose, 
5 depending upon the severity of the defect [Van de Poll & 

Schamhart, (1977) Molecular and general Genetics 154, 61-66? 
Banuelos, M. & Fraenkel, D.G. (1982) Molecular and Cellular 
Biology 2, 921-929]. Such mutants grow normally on galactose. 
Therefore, during the selection of strains in which the TSS1 

10 gene has been deleted or modified it is sometimes essential and 
always advisable to grow the transf ormants on galactose, 
because in many cases the desired transf ormant will be unable 
to grow on any other common sugar, including the routinely used 
glucose. This is an unexpected methodological consideration 

15 that would not be obvious even to a person skilled in the art: 
special knowledge about the sequence and chromosomal. location 
of the TSS1 gene is required, which we now disclose. 

Since our disclosure of the identity of TSSl with FDP1 and CIF1 
20 in USPA 841,997, a confirmation has been published by Bell, W;, 
et al. [(1992) European Journal of Biochemistry 209, 951-959)] 

The inventors' previous work [Londesborough & Vuorio (1991) 
loc. cit.) showed that the TPS catalytic activity of what is 

25 now known to be trehalose synthase requires a so-called 

TPS-Activator protein, which is a dimer of 58 kDa subuttits. We 
have identified this protein by the amino acid sequences of 
peptides it contains and by its catalytic activity and disclose 
that it is yeast phosphoglucoisomerase. We disclose that 

30 fructose-6-phosphate (F6P) , which could be made by phospho- 
glucoisomerase from the glucose-6-phosphate (G6P) in the assay 
mixtures used to measure TPS activity, is a powerful activator 
of the TPS activity of intact trehalose synthase. Also, when 
the assay mixture contains an equilibrium mixture of G6P and 

35 F6P the TPS-Activator has no further effect, so that its 

phosphoglucoisomerase activity is a complete explanation of the 
activation it causes. Furthermore, the TPS activity of 
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truncated trehalose synthase does not r quire F6P, and is not 
s strongly inhibited by phosphate as is that of the native 
enzyme. Thus, a trehalose synthetic pathway can in principle be 
transferred to any organism by transforming the organism with 
5 th structural genes for yeast tr halose synthase: it is not 
necessary to simultaneously introduce the TPS-activator , 
because F6P is a ubiquitous component of cells. Furthermore, if 
the amounts of F6P in an organism are inadequate, or phosphate 
concentrations are too high, the organism can be transformed 

10 with a truncated version of TSL1 encoding the truncated long 
chain that confers insensitivity to phosphate and F6P. This 
aspect of the present invention is particularly significant, 
because it both allows the introduction of a trehalose 
synthetic pathway to organisms in which the cytosolic phosphate 

15 and F6P concentrations would prevent the efficient function of 
yeast trehalose synthase, and also may permit trehalose 
synthase to function efficiently at stages of yeast growth -when 
native trehalose synthase would be inhibited by cytosolic 
phosphate. We disclose that intact trehalose synthase can be 

20 liberated from phosphate inhibition by treatment with trypsin 
in vitro . 

From the knowledge gained from the present invention, it is 
possible to produce trehalose recombinantly by transforming a 
25 host cell with the appropriately modified TSS1, TSL1 and TSL2 
genes. Methods of transformation and appropriate expression 
vectors are well-known in the art. 

Expression vectors are known in the art for both eukaryotic and 
30 prokaryotic systems, and the present invention contemplates use 
of both systems. For transformation of yeast at least two 
classes of promoters are contemplated. Yeast that accumulates 
more trehalose but at the usual time (i.e., after consumption* 
of fermentable carbon sources) can be made by inserting extra 
35 copies of the genes under their own promoters, or stronger 

promoters with similar control. Such yeast can hav improved 
storage properties and stress resistance and be more conomic 
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sources of trehalose. Y ast that synthesiz s trehalose during 
fermentation can be made by replacing the genes' own promoters 
with promoters (such as ADH1) that are active during 
fermentation. As explained above, such y ast can have increased * 
5 fermentation rat s f ethanol yi Ids and r sistance to osmotic 

and temperature stress during fermentation. ? 

Also contemplated are modifications of the DNA sequence which 
would provide "preferred" codons for particular expression 

10 systems (e.g. , bacteria and higher plants) . In addition, the 
TSS1, TSL1 and TSL2 DNA sequences may be modified by certain 
deletions or insertions, provided the translated polypeptides 
are enzymatically functional. Expression of functional 
polypeptides from TSS1, TSL1 and TSL2 may be confirmed by 

15 assaying for TPS and/ or TPP activity in the expression system 
by the methods described in Londesborough and Vuorip £(1991) 
loc. cit.]. Deletion of the first 330 amino acids or so from 
the 123 kDa long chain to give an enzyme active at higher 
phosphate and lower F6P concentrations has already been 

20 mentioned. 

The genes of the present invention may be transferred and 
expressed in plants by using the Ti plasmid system which is 
well known in the art. The internal transforming genes of a 

25 cloned T-DNA can be removed by recombinant DNA techniques and 

replaced by the genes of the present invention and expressed in 
plant tissues. Commonly, the coding sequence of the foreign 
gene (for instance, TSL1) is substituted for the coding region 
of the opine synthetase gene. In this way, the natural promoter 

30 and polyadenylation signals of the opine synthetase gene confer 
high- level expression of the foreign protein. Any method known 
in the art, however, may be used to transform higher plants 
with the genes of the present invention. 

35 The following examples are for illustration of the present 

invention and should not be construed as limiting the present 
invention in any manner. 
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Gen ral Methods and Materials, 

Materials . Fructos 6-phosphate (F6P) and adenosine 5'- 
5 diphosphoglucos (ADPG) were from Sigma Chemicals. Glucose 
6-phosphate (66P) , phenylmethylsulphonyl fluoride (PMSF) , 
uridine 5 ' -diphosphoglucose (UDPG) and other commercial 
reagents were from the sources stated in Londesborough & Vuorio 
[(1991) loc. cit.]. Truncated trehalose synthase (proteolytic- 

10 ally activated M TPS/P M ) and TPS activator were prepared as 
described in Londesborough & Vuorio [(1991) loc. cit.]. The 
antisera, anti-TPS/P, anti-57K and anti-93K were made in 
rabbits using as antigen, respectively, truncated trehalose 
synthase, the short (57 kDa) chain and the 93 kDa fragment of 

15 the long chain of trehalose synthase as described in 
Londesborough & Vuorio [(1991) loc. cit.]. 

Buffers for enzvme extraction and purification . Two standard 
cocktails, HBMED (25 mM Hepes/KOH pH 7.0/1 mM benzamidine/1 mM 

20 MgCl 2 /0*l mM EDTA/1 mM dithiothreitol) and HB2M1ED (HBMED but 
with final concentrations of 2 mM MgCl 2 and 1 mM EDTA) were 
used as basal buffers during preparation of cell extracts and 
purification of enzyme. Where indicated, the Hepes and 
benzamidine concentrations were increased to 50 mM and 5 mM, 

25 respectively. 

Yeasts . Commercial baker's yeast was from Alko's RajamSki 
factory. The standard laboratory strains of S. cerevisiae us d 
were X2180 (ATCC 26109) and S288C (ATCC 26108). Mutant strains 
are described in the Examples and Table 1 lists important 

30 strains of microorganisms and plasmids. Laboratory yeast .were 

routinely grown on 1 % yeast extract/2% peptone (YP) containing 
the indicated carbon source in aerobic shake flasks at 30 °C 
and 200 r.p.m. Cells were harvested by centrifugation for 5 
minutes at 3000 g, resuspended in distilled water and again 

35 centrifuged 5 minutes at 3000 g. The pellets were suspended in 
about 20 volumes of HB2M1ED and centtifuged in tared tubes for 
10 minutes at 15,000 g. Tubes and pell ts were weigh d to give 
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the mass of fresh yeast. For trehal se determinations , portions 
of the pellets were treated as described by Lillie, S.H. & 
Pringle, J.R. [(1980) Journal of Bacteriology 143, 1384-1394). 
The washed cells were broken by suspending them at 0 °C in 1 to a 
5 4 volumes of HB2M1ED, adding fresh stock PMSF/pepstatin (1 mg 

pepstatin A/ml 0.1 M PMSF in methanol) to give final , 
concentrations of 10 tig pepstatin/ml and 1 mM PMSF, and shaking 
with glass beads for three 1 minute periods in a Braun MK XI 
homogenizer or (for amounts less than 0.3 g fresh yeast) "by 

10 vortexing in an Eppendorf tube. The glass beads were removed 

and the volume of homogenate was measured. Samples for SDS-PAGE 
were made at once by dilution with Laemmli sample buffer 
[Laemmli, U.K. (1970) Nature, London 227, 680-685]. The 
homogenates were then centrifuged as indicated (usually 5 min 

15 at 5,000 g or 20 minutes at 28,000 g) . Enzyme assays were made 
on the homogenates and supernatants and protein determined in 
the supernatants from A280 and A260 measurements. 

20 . Table 1. List of important strains" and plasmids 

Name Description Source 

Saccharomvces cerevisiae 

25 

X2180 (ATCC 26109) Standard laboratory yeast (diploid) 

S288C (ATCC 26108) Standard laboratory yeast (haploid) 

30 Klg 102 cif 1-102, leul, ural, trp5, MATa l 

MV6807 fdpl f leu2, ura3, his3 r lys2, ade8, 2 

trpl, MATa *> 

35 S150-2B leu2, his3, trpl, ura3 f Mata 

ALK03569 tssl::LEU2 (from S150-2B) This work 



ALKO3570 
WDC-3A 

Escherichia coli 
HB101 (ALKO 683) 
ALK03566 
ALK03568 
Plasmids 

PALK751 (DSM 6928) 

PALK752 

PALK753 

PALK754 

PALK756 (DSM 7425) 
PALK757 



25 

tssl::LEU2 (from S150-2B) . This work 

cifl::HIS3, his3, ura3, ade2, MATo 3 



HB101 containing pALK752 This work 

HB101 containing pALK754 This work 



pBluescript containing an 8.. 2 kb This work 
insert comprising TSL1 

pBluescript containing a 2.5kb This work 
insert comprising TSSl 

pBluescript containing a 3.3 kb This work 
insert comprising the ORF of TSSl 

pBluescript containing a 4.4 kb This work 
insert comprising TSL1 

pBluescript containing a 3.5 kb This work 
insert comprising at least part 
of TSL2 

pBluescript containing an This work 

insert comprising the ORF of TSLl 



pMB14 



YEp352 containing CIF1 
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Sources: 1. Dr. D. Fraenkel, Harvard Medical School, U.S.A. 



2. Dr. J. Thevelein, Lab voor Plant nbioch. , Heverlee, 
Belgium. « 

3. Dr. C. Gancedo, CSIC, Madrid r Spain. \ 



Enzyme Assays . TPP and TPS standard assays and other kinetic 
measurements were made as described by Londesborough & Vuorio 
10 [(1991) loc. cit.] except that the standard TPS assay mixture 
contained 5 mM F6P unless stated otherwise. Where appropriate, 
TPS assays were corrected by measuring UDP production from UDPG 
in the absence of G6P and F6P. 

15 DNA manipulations * Stratagene's (La Jolla, California) 

Escherichia coli strain XL-1 Blue {recAl, endAl, gyrA96 f thi, 
hsdR17, supE44,* relAl r lac, [F' proAB f laclq ADM15, TnlO 
(tetR)J} were used as host bacteria. When needed, XL-1 Blue 
cells were made competent by the method of Mandel & Higa (1970) 

20 Journal of Molecular Biology 53 , 159-162]. The cloning vector 
was Stratagene's Lambda Zap II , predigested with EcoRI, where 
the cloning site is near the N-terminus of the gene for 
S-galactosidase, thus enabling the color selection of 
recombinant clones. The sequencing vectors M13mpl8 and M13mpl9 

25 from Pharmacia LKB Biotechnology were also used. 

High molecular mass DNA from the haploid S288C strain was 
prepared as described Johnston, J.R. [(1988) in Yeast, A 
Practical Approach, IRL Press, Oxford] and partially digested 

30 with either Haelll or EcoRI restriction enzyme. For the large 

scale Haelll digestion, e.g., a reaction mixture of 330 /il * 
containing 30 /Ltg of DNA and 4.8 U of enzyme was incubated at 37 
°C for 60 minutes. The reaction was stopped with 10 pi of 0.5 M * 
EDTA and transferred to ice. The methods for such digestions 

35 and their agarose gel electrophoretic analysis are well known 
in the art and are described, e.g., in Sambrook et al. , 
Molecular Cloning, A Laboratory Manual [Cold Spring Harbor 
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Laboratory Press, 2nd d. , (1989)]. 

Plasmid DNA was isolated using standard methods for small scale 
purif ication Sambrook et ah [(1989) Molecular Cloning, A 
5 Laboratory Manual, 2nd d. , Cold Spring Harbor Laboratory 

Press, hereby expressly incorporated by reference] . Large scale 
purifications of plasmid DNA were done with Qiagen tip-100 
columns from Diagen following their instructions. 

10 DNA sequences were determined either manually by the dideoxy- 
chain termination method [Sanger et al. (1977) Proceedings of 
the National Academy of Sciences U.S.A. 74, 5463-5467], 
sequencing directly from pBluescript plasmids, or automatically 
with the Applied Biosystems Model 373A automatic DNA sequencer, 

15 sequencing either directly from these plasmids or from M13 
subclones. * 

Southern and Western hybridizations and other standard 
manipulations were carried out by well known procedures [see, 
20 e.g., Sambrook et al. (1989) loc. cit.]. 

Example 1, Purification of intact trehalose synthase 

Intact trehalose synthase was purified from commercial baker's 
25 yeast. The method described by Londesborough & Vuorio [(1991) 
loc. cit*] for purification of "proteolytically activated 
TPS/P" was modified as follows: 

1. All buffers contained 2 mM MgCl 2 and 1 mM EDTA. This 

30 increased yields in the early steps and probably helped to 
decrease proteolysis in the later steps. 

2. In the first ammonium sulphate fractionation, the EDTA 
concentration was increased to 2.5 mM before addition of 

35 ammonium sulphate. 



3. All buffers were adjusted to between 0.4 and 1 mM PMSF and 
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betw en 4 and 10 iiq pepstatin A/ml by addition, immediately 
before use, of the appropriate amount of a freshly prepared 
stock solution containing 1 mg pepstatin A/ml 0.1 M PMSF in 
methanol (called,, stock PMSF/pepstatin) . When, as in 
5 chromatography, buffers were used for several hours, more stock 
PMSF/pepstatin was added at intervals, but so as not to exceed 

1.5 % methanol in the buffer, or a fresh lot of buffer was 
taken into use, because of the short half-life of PMSF in 
aqueous solution. All columns were equilibrated with at least 

10 one bed volume of buffer containing PMSF and pepstatin A 
immediately before application of enzyme. 

4* Experience permitted the enzyme-containing fractions (a 
total of 17.8 ml in the preparation of Table 2) from 

15 Heparin-Sepharose to be identified as soon as they were eluted. 
Stock PMSF/pepstatin (150 and 0.1 M EDTA (200 fil) were 
immediately added to them. Then 7.2 g of powdered ammonium 
sulphate was slowly added (over 20 min) . After 30 min 
equilibration, the mixture was centrifuged 15 min at 28,000 g- 

20 The pellets were packed for 5 min at 28,000 g and expressed 
buffer was removed with a pasteur pipette. The pellets were 
dissolved to 2.0 ml in HB2M1ED containing 0.8 mM PMSF and 8 /xg 
pepstatin A/ml, centrifuged 5 min jat. 28,000 g and applied to a 

2.6 x 34 cm column of Sepharose 6B freshly equilibrated with 
25 HB2M1ED containing 50 mM NaCl, 0.4 mM PMSF and 4 fig pepstatin 

A/ml. The interval between elution from Heparin-Sepharose and 
application to Sepharose 6B was 5 h. In the Londesborough & 
Vuorio [(1991) loc. cit.] procedure, the Heparin-Sepharose 
eluates were stored at about 3 °C, without addition of PMSF or 
30 pepstatin A, for 5 days before the second ammonium sulphate 
fractionation and application to Sepharose 6B. 

5. Fractions (3.7 ml) from the Sepharose 6B column were 
immediately mixed with 20 /xl of stock PMSF/pepstatin and then 
35 assayed. Again, experience permitted the correct fractions to 
be pooled, based on activity and A280 measurements without 
SDS-PAGE analysis, and immediately applied to a 0.7 x 7 cm 
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column of UDP-Glucuronat -Agarose equilibrated with HB2H1ED 
containing 50 mM NaCl, 0.4 mM PMSF and 4 /xg pepstatin A/ml. The 
enzyme was eluted as described by Londesborough & Vuorio 
[(1991) loc. cit.] and 10 m! of stock PMSF/pepstatin added to 
5 each 1.7 ml fraction. Each fraction was divided into three. Two 
portions were stored at -70 °C and one at 0 °C. 

Table 2 summarizes a purification and Fig. 1 shows the SDS-PAGE 
analysis of fractions eluted from UDP-Glucuronate-Agarose. No 

10 obvious differences were apparent between enzyme eluted by 0.2 
M NaCl and that eluted by io mM UDPG/0.4 M NaCl. The major 
bands present had molecular masses of 57, 99 and 123 kDa. 
Several weaker bands were present between the 123 kDa band and 
about 90 kDa. In Western analyses the 123 kDa, 99 kDa and most, 

15 if not all, of the fainter bands in this region were recognized 
by the anti-TPS/P and anti-93K sera. This suggests that the 
fainter bands are partially degraded long chains. The weak 
bands at 68 kDa also reacted with the anti-93K serum, but could 
be removed by chromatography on DEAE-cellulose (see Example 9) • 

20 When the antibodies from anti-93K serum that bound to the 99 
kDa band were eluted from a nitrocellulose blot [as described 
by Pringle, j.R. (1991) Methods in Enzymolgy 194, 565-590] and 
used to probe another blot, they bound also to the 123 kDa 
band, showing that the two long chains of trehalose synthase 

25 have epitopes in common. 

Intact enzyme binds less tightly to the XIDP-Glucuronate-Agarose 
than the truncated enzyme purified by Londesborough & Vuorio 
[(1991) loc. cit. ] and the proportion of enzyme remaining bound 

30 at 0.2 M NaCl varied from preparation to preparation. When #9 
of Table 2 was re-run on the same column, 76 % of the TPS 
activity was again recovered at 0.2 M NaCl (and 25 % by 0.4 M 
NaCl/ 10 mM UDPG) f so that overloading of the column is not the 
reason why this enzyme eluted at 0.2 M NaCl. However, when 

35 enzyme eluted at 0.2 M NaCl was truncated with trypsin as 

described in Example 10, it then bound to the column at 0.2 M 
NaCl and was only recovered at 0.4 M NaCl/10 mM UDPG. Thus, as 
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well as altering the kinetic properties of the enzyme (s e 
Examples 10 & 12) , this truncation also increases the affinity 
for UDP-Glucuronate-Agarose. Presumably there are subtl 
differ nces in factors such as th amount of adventitious 
5 proteolysis and stat of aggr gation between enzyme elut d at 
0.2 M NaCl and that remaining bound. For the preparation 
summarised in Table 2, the ratio of standard TPP and standard 
TPS activities increased from 22 % in #9 to 39 % in #14,. 
showing -that there are differences, even though they could not 
10 be clearly detected by SDSHPAGE. 

These findings disclose that a highly purified trehalose 
synthase containing a 57 kDa short chain , a 123 kDa long chain 
and a 99 kDa polypeptide that is recognised by the anti-93K 

15 serum possesses both TPS activity activatable by TPSActivator 
protein (or F6P) and TPP activity. The rate of -hydrolysis of 1 
mM G6P in either phosphate or Hepes buffer was less than 1 % of 
that of 0.5 mM trehalose-6-phosphate, so that the TPP activity 
is highly specific. An unexpected finding is that this highly 

20 purified preparation contains the 99 kDa polypeptide , which is 
not present in the purified truncated trehalose synthase. It is 
disclosed later that this polypeptide is not a degradation 
product of the long (123 kDa) chain, whereas both the 86 and 93 
kDa polypeptides of truncated enzyme contain amino acid 

25 sequences that identify them as fragments of the long (123 kDa) 
chain. This novel preparation possesses some unexpected 
catalytic properties , which are described in more detail in 
Example 11. 

30 Table 2. Purification of intact trehalose synthase 

The preparation is from 60 g of pressed baker's yeast. TPS 
activities "Without Activator" were measured as described by 
Londesborough & Vuorio [(1991) loc. cit.], i.e. r in the absence 
35 of F6P. Assays "With Activator" were det rmined similarly but 
in the presence of a saturating amount of pure TPS activator 
(similar values were obtained when some fractions were later 



assayed in th presenc of 5 mM F6P instead of TPS activator, 
and are shown in parentheses) . ND, not determined. 



Faction VolW* Without Activator With Activator 

iffiH nZml a/ma Total n a/ml U/mo Total U 

1st. (NH 4 ) 2 S0 4 Precipitate 13.4 58 1.0 810 ND ND ND 15 

G25 eluate 22.2 30 1.1 668 ND ND ND 

Heparin- Sepharose eluate 18.2 ND ND ND «21 *ll »380 

Sepharose 6B eluate 26 1.4 5.1 36 4.7 17 121 
ODP-glucuronate agarose eluates: 
at 0.2 M NaCl 

* 9 1.7 4.6 3.1 - 11.5(12) 12 
» 1° 1.7 ND ND ND 12.2 21 

* 11 1.7 ND ND ND 6.3 23 58 

* 12 ' 1-7 ND ND ND 3.9(3.3) 22 
at 0.4 M NaCl/10 mM UDPG 

* 13 1-7 2.1 - - 5.9(6.2) 25-30 a 

* 14 1-7 3.7 - - 9.3 25-30 a 

27 

#15 1.7 ND - - 0.8 



a Based on protein .contents estimated from Coomassie blue- stained SDS- PAGE 
gels • 

b Results from other preparations show that the activity with 

excess TPS-activator (or 5 mM F6P) is not, at this step, more 

than 10 % greater than that without activator. 

Example 2 Increased expression bv S. ce revisias of the long and 
short chains of trehalose synthase af ter consumption of glucose 

Three 500 ml lots of YP/2 % glucose in i 1 shake flasks were 
each inoculated with 1 ml of a suspension of X2180 cells of 
A600 1.0 and shaken at 200 r.p.m. at 30 °C. At the times shown 
in Table 3, the cells were harvested, broken and analyzed as 
described in General Materials and Methods. The 28,000 g 
supernatants were stored for a week at -18 °c, thawed and 
re-centrifuged for 20 min at 28,000 g. Portions of 150 ftl (each 
equivalent to 53 mg of fresh yeast) were mixed with 30 /il of 
anti-TPS/P serum, equilibrated for 30 min at O «C and 
centrifuged for 10 min at 10,000 g. The pellets were washed 
with 250 Ml of HBMED and then dissolv d in Laemmli sample 
buffer and subjected to SDS-PAGE (Fig 2). Bands at 57, 99 and 
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123 kDa were strong in the sample (C) from stationary phase 
yeast and in the sample (B) harvested immediately after 
disappearance of glucose from the medium, but were absent or 
very weak in the sample (A) from yeast growing in the presence 
of 1.2 % glucose. 



Table 3 . Appearance of TPS and TPP activities in X2180 yeast 
grown on YP/2 % glucose. 

Enzymes were assayed in the 28 , 000 g supernatants . 





A 


B 


C 


i 


Age (h) 


16.1 


18.1 


39. 


0 


Residual glucose 


1.2 


<0.001 


£0. 


001 


(g/100 ml medium) 










Fresh yeast mass 


7.6 


14.8 


29. 


5 


(mg/ml medium) 










Trehalose 


0.73 


3.1 


94 




(mg/g dry yeast) 










TPS (TT/g fresh yeast) 


1.2 


7.4 


10. 


5 


TPP (U/g fresh yeast) 


0.29 


2*. 2 


3. 


0 ' 


TPP/TPS (%) 


24 


30 


29 





Control experiments (not shown) indicated that pre- immune serum 
did not precipitate the 57 , 99 and 123 kDa bands, and that 
using 50 Ml of serum instead of 30 /xl did not. precipitate more 
of these three bands from the C sample. 

These results disclose that the co-ordinate, 7-fold increase in 
TPS and TPP activities that occurs during less than 2 h when 
glucose disappears from the medium is accompanied by increas s 
in the amounts in yeast of three polypeptides, of mass 57, 99 
and 123 JcDa, that are immunoprecipitated by anti-TPS/P serum. 
These polypeptides are those f ound in the intact trehalose 
synthase purified in Example 1. Thus, increase in the amount of 
enzyme protein is a major mechanism by which the capacity of 
yeast to synthesize trehalose is increased. 
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Example 3 Determination of the N-termin al amino acid secru nces 
of peptides isolated from the various polypeptides of trehalose 
synthase 

5 The 57 , 86 and 93 kDa polypeptides of the truncated trehalose 
synthase were separated by SDS-PAGE, digested on nitro- 
cellulose blots and fractionated by HPLC as described by 
Londesborough & Vuorio [(1991) loc. cit.]. Also, these 
polypeptides and polypeptides of molecular mass 57 , 99 and 123 

10 kDa immunoprecipitated from yeast extracts as described in 

Example 2 were separated by SDS-PAGE and digested in the gel 
with lysylendopeptidase C as described by Kawasaki, H. , Emori, 
Y. and Suzuki, K. (in press) . The derived peptides were . 
separated by HPLC using a DEAE pre-column before the reverse- 

16 phase column essentially as described by Kawasaki et al [(1990) 
Analytical Biochemistry 186, 264-268]. The 99 kDa polypeptide 
isolated by chromatography on phosphocellulose in trie absence 
of triton (see Example 9) was digested with lysylendopeptidas 
. c and the peptides separated by HPLC. In all cases, isolated 

20 peptides were sequenced in a gas-pulsed liquid phase sequencer 
as described by Kalkinen, N. & Tilgman, C [(1988) Journal of 
Protein Chemistry 7, 242-243], the released PTH-amino acids 
being analysed by on-line , narrow-bore, reverse-phase HPLC. The 
sequences are shown in Table 4. 

25 

Table 4. N-terminal amino acid sequences of peptides isolated 
from f fragments of! the polypeptides of trehalose synthase. 

When two sequences were obtained from the same HPLC peak, they 
30 are shown as a and b sequences, where possible according to the 
sequences predicted from the genes. Tentative identifications 
from the amino acid sequencer are shown by the one letter codes 
followed by double queries, Unidentified residues are shown by 
Xaa. (In the Sequence Listings, also tentatively identified 
35 residues are indicated as Xaa) . The location of each amino acid 
sequence in the short (S) and long (123 kDa) (L) chains of Figs 
3b and 4b is shown below the sequence. 
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Short (57) chain peptides 



Trvptic peptides from bl ots of the 57 kDa polypeptide from 
truncated trehalose synthase . 

848 Tyr-Ile-Ser-Lys 

(SEQ ID NO: 5) (S 463-66) 

850 Asp-Val-Glu-Glu-Tyr-Gln-Tyr-Leu-Arg 
(SEQ ID NO: 6) (S 333-41) 

859 His-Phe-Leu-Ser-Ser-Val-Gln-Arg 
(SEQ ID NO: 7) (S 223-30) 

862a Val-Leu-Asn-Val-Asn-Thr-Leu-Pro-Asn-Gly-Val-Glu- 
Tyr-Gln 

(SEQ ID NO:8) (S 231-44) 

862b Ser-Val-Val-Asn-Glu-Leu-Val-Gly-Arg 
(SEQ ID NO: 9) (S 342-50) 

863 Leu-Tyr-Lys 

(S 460-2) 

864 Glu-Thr-Phe-Lys 

(SEQ ID NO: 10) (S 280-3) 

866 Leu-Asp-Tyr-Ile-Lys 

(SEQ ID NO: 11) (S 294-8 > 



870 



Ile-Leu-Pro-Val-Arg 

(SEQ ID NO: 12) (S 196-200) 



From lvsvlendopeptidase C digests of immunoprecipitated 57 kDa 
band 



966a Glu-Val-Asn-Xaa-Glu-Lys 

(SEQ ID NO: 13) (S 454-9) 

966b Phe-Tyr-Asp-Xaa-L?? 

(SEQ ID NO: 14) (not found) 

980 Leu-Xaa-Ala-Met-Glu-Val-Phe-Leu-Asn-Glu-Xaa-Pro-Glu 
(SEQ ID NO: 15) (S 304-16) 

981 Tyr-Thr-Ser-Ala-Phe-Trp-Gly-Glu-Asn-Phe-Val-Xaa- 
Glu-Leu 

(SEQ ID NO: 16) (S 467-80) 

987 Phe-Gly-Xaa-Pro-Gly-Leu-Glu-Ile-Pro. 
(SEQ ID NO: 17) (S 63-71) 



Long f!23 kDal chain peptides 

Tryptic peptides from blots of the 86 and 93 kDa fragments . 

889 D??-Gly-Ser-Val-Met-Gln 

(SEQ ID NO: 18) (L 587-592) 

890/891 Leu-Pro-Gly-Ser-Tyr-Tyr-Lys 
(SEQ ID NO: 19) (L 917-23) 

8 92 a Ala-I le-Val-Val-Asn-Pro-Met-Asp-Ser-Val-Ala 

(SEQ ID NO: 20) (see peptide 1299) 



892b 



Met-Ile-Ser-Ile-Leu 

(SEQ ID NO: 21) (L 842t7) 
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From lvsvlendooeptidase digest of combined 86 and 93 kDa 
fragments . 



1171 Arg-Arg-Pro-Gln-Trp-Lys 

(SEQ ID NO: 22) (L 770-5) 



From lvsvlendooeptidase digest of the 86 kDa fragment. 

10 1479 Thr-Leu-Met-Glu-Asp-Tyr-Gln-Ser-Ser-Lys 

(SEQ ID NO: 52) (L 816-26) 

1483a Ala-Phe-Glu-Asp-His-Ser-Trp-Lys 
(SEQ ID NO: 78) (L 445-52) 

15 

1483b Ala-Gly-His-Ala-Ile-Val-Tyr-Gly-Asp-Ala-Thr-Ser-Thr- 
Tyr-Ala-Lys 
(SEQ ID NO: 79) (L 1064-79) 

20 1481 Glu-Arg-Leu-Pro-Gly-Ser-Tyr-Tyr-Lys 

(SEQ ID NO 80) (L 914-23) 

From lvsvlendooeptidase digest of the 93 kDa fragment. 

25 1480 Thr-Leu-Ket-Glu-Asp-Tyr-Gln 

(SEQ ID NO: 81) (L 816-23) 

1484a Ala-Phe-Glu-Asp-His-Ser-Trp-Lys 
(SEQ ID NO: 78) (L 445-52) 

30 

1484b Ala-Gly-His-Ala-Ile-Val-Tyr-Gly-Asp-Ala-Thr-Ser- 
Thr-Tyr -Ala -Ly s 
(SEQ ID NO: 79) (L 1064-79) 

35 1485 Glu-Arg-Leu-Pro-Gly-Ser-Tyr-Tyr-Lys 

(SEQ ID NO: 80) (L 914-23) 
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From lvsv lendopeptidase digests of immunoprecipitated 124 kDa 
band 



1047 



Ser-D? ? -Pr o-Gln-Ly s 

(SEQ ID NO: 23) (not found) 



10 



1048 Phe-Tyr-Arg-Asn-Leu-Asn-Gln-Arg-Phe-Ala-Asp-Ala« 
Ile-Val-Lys 
(SEQ ID NO: 24) (L 453-67) 

1054a Asp-Gly-Ser-Val-Met-Gln-W??-Xaa-Gln-Leu-I?? 
(SEQ ID NO: 25) (L 587-97) 



1054b Asn-Ala-Ile-Asn-Thr-Ala-Val-Leu-Glu-Asn-Ile-Ile- 
15 Pro-H??-Xaa-H??-Val-Lys 

(SEQ ID NO: 26) (L 360-77) 



20 



1061 



1063 



Leu-Val-Asn-Asp-Glu-Ala-Ser-Glu-Gly-Gln-Val-Lys 
(SEQ ID NO: 27) (L 1052-63) 

V? ? -Gln-Asp-Ile-Leu-Leu-Asn-Asn-Thr-Phe-N?? 
(SEQ ID NO: 28) (not found) 



25 



1375 



Phe-Leu-Val-Glu-Asn-Pro-Glu-Tyr-Val-Glu-Lys 
(SEQ ID NO: 50) (L 629-39) 



1376 



R??-Ile-Thr-Pro-His-Leu-Thr-Ala-Xaa-Ala-Ala 
(SEQ ID NO: 51) (L 245-55) 



30 



1377 



Thr-Leu-Met-Glu-Asp-Tyr-Gln-Ser-Ser-Lys 
(SEQ ID NO: 52) (L 816-26) 



35 



1378-1 Ile-Leu-Glu-Gly-Leu-Thr-Gly-Ala-Asp-Phe-Val-Gly- 
Phe-Gln-Thr 

(SEQ ID NO: 53) (L 521-35) 
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1378-11 Gln-Il -Leu-Xaa-Pro-Thr-L u-Xaa-Tyr-Gln-Ile-Pro- 
Asp-Asn 
(SEQ ID NO: 54) (L 427-40) 

5 1380 Phe-Gly-Gly-Tyr-Ser-Asn-Lys 

(SEQ ID NO: 55) (L 319-25) 

13 8 1 Phe-Xaa-Thr-Glu-Asn-Ala-Glu-Asp-Gln-Asp-Xaa-Val- 
Ala-Xaa-Val-Ile-Gly-G? ? -Ala-I le-Xaa-Xaa-Ile 
10 (SEQ ID NO: 56) (L 931-53) 

13 82 Xaa-Val-Gly-Thr-Val-Gly-Ile-Pro-Thr-Asp-Glu-Ile- 
Pro-Glu-Asn-Ile-Leu-Ala 
(SEQ ID N0:57) (L 378-95). 

15 

The 99 kDa polypeptide 

From lysylendopeptidase digests of immunoprecipitated 99 kDa 
band 

20 

959 Asp-Thr-Thr-Gln-Thr-Ala-Pro-Val-T??-Asn-Asn-Val- 
Xaa-Pro 
(SEQ ID NO: 29) 

25 961 Asn-Gln-Leu-Asp-Ala-A??-Asn-Tyr-Ala-Glu-Val . 

(SEQ ID NO: 30) 

1002a Asn-Leu-Ser-Arg-Trp-Arg-Asn-Tyr-Ala-Glu 
(SEQ ID NO:31) 



30 



1002b Trp-Gln-Gly-Lys 
(SEQ ID NO: 32) 



1043 

35 



Ile-Gln-Leu-Gly-Glu-Ser-Asn-Asp-Asp-D??-L?? 
(SEQ ID NO: 33) 
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Glu-Val-Pro-Thr-Il -Gln-Asp-Xaa-Thr-Asn-Lys 
(SEQ ID NO: 34) 

Xaa-Tyr-Xaa-Tyr-Val-Lys 
(SEQ ID NO: 35) 

Asn-Gln-Leu-Gly-Asn-Tyr 

(SEQ ID NO: 36) . 

Val-Ala-Leu-Thr 
(SEQ ID NO: 37) 

Asp-Ala-Ile-Val-Val-Asn-Pro-Xaa-Asp-Ser-Val-Ala 
(SEQ ID NO: 38) 

• Ser-Leu-Leu-Asp-Ala-Gly-Ala-Lys 
(SEQ ID NO: 44) 

Glu-Lys-Pro-Gln-Asp-Leu-Asp-Asp-Asp-Pro-Leu-Tyr- 
Leu-Thr 
(SEQ ID NO: 45) 

D??-Gln-Xaa-His-Gln-Asp-Xaa-Xaa-Asn-Leu-Thr 
(SEQ ID NO: 46) 

Phe-Asn-Asp-Glu-Ser-Ile-Ile-Ile-Gly-Tyr-Phe-P??- 
Xaa-Ala-Pro 

(SEQ ID NO: 47) 

Ser-Arg-Leu-Phe-Leu-Phe-Asp-Tyr-Asp-Gly-Thr-Leu- 
Thr-Pro 
(SEQ ID NO: 48) 
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From Ivsvlendooeptidase digest of 99 kDa protein purified on 
phosphocellulose 

1451 Gln-Leu-Gly-Asn-Tyr-Gly-Phe-Tyr-Pro-Val-Tyr 
5 (SEQ ID NO:49) n 



Apart from peptide 966b, all the amino acid sequences 

10 determined from the short chain samples have been located in 
the protein sequence deduced from the TSS1 gene (see Figure 
3b) . Apart from peptides 892a, 1047 and 1063, all the amino 
acid sequences determined from the 86 and 93 kDa fragments of 
• the long chain and from the intact 123 kDa long chain itself 

15 have been located in the protein sequence deduced from TSL1. 

The HPLC profiles obtained from digests of the 86 kDa fragment 
were essentially identical with those from digests of the 93 
kDa fragment when either trypsin or lysylendopeptidase C was 
used (not shown) • Also, corresponding HPLC peaks from 86 and 93 

20 kDa digests yielded the same sequences or double sequences 

(peptide pairs 890 & 891; 1479 & 1480; 1483a, b & 1484a, b; 1481 
& 1485) . These results disclose that both the 86 and 93 kDa 
polypeptides in truncated enzyme are derived from the 123 kDa 
long chain encoded by TSL1. In particular, it is not the case 

25 that one or other of these fragments is derived from the 99 kDa 
polypeptide, although contamination with minor amounts of 
(degradation products of) that polypeptide is probable (see 
below) . 

30 None of the 16 amino acid sequences obtained from the 99 kDa 

polypeptide is encoded by TSL1. The first 5 residues of peptide 
1451 from the 99 kDa polypeptide purified on phosphocellulose 
are identical with the last 5 residues of peptide 1297a from 
immunoprecipitated 99 kDa polypeptide. This confirms that the 

35 99 kDa polypeptide immunoprecipitated by anti-TPS/P serum from 
yeast extracts is the same as the 99 kDa polypeptide in 
purified intact enzyme. These results disclose that the 99 kDa 
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polypeptide is not encoded by TSL1 (or TSS1) but by another 
g ne, which the inventors call TSL2 . 

The origin of peptides 1047 and 1067 found in the digest of the 
5 intact (123 kDa) long chain is not known. The only peptide from 
the long chain fragments of truncated enzyme not encoded by 
TSLl is 892a from the 93 kDa fragment. This is identical with 
the last 11 residues of peptide 1299 from the 99 kDa 
polypeptide. This suggests that the 93 kDa band was 

10 contaminated with some material derived from the 99 kDa 

polypeptide , although this polypeptide itself was not visible 
in SDS-PAGE analyses of the truncated enzyme. The identical 
HPLC profiles of digests of the 86 and 93 kDa fragments and the 
fact that only one peptide derived from the 99 kDa polypeptide 

15 was identified in these digests shows that the contamination 

was at a low level. This discloses that a functional truncated 
trehalose -synthase with both TPS and TPP activities probably 
requires only polypeptides encoded by TSS1 and TSLl. 

20 Example 4 Cloning and sequencing of TSS1 

(a) Preparation and screening of a yeast genomic DNA library 

A genomic library was constructed in the bacteriophage lambda 
vector, Lambda Zap II , using a partial Haelll digest of iL. 

25 cerevisiae strain S288C chromosomal DNA, according to 

Stratagene's Instruction Manual for the Zap-cDNA synthesis kit. 
The DNA from the ligation reaction was packaged into Giga II 
Gold packaging extract (Stratagene) according to the 
manufacturer's instructions (1990). The titer of the 

30 recombinants was determined on Luria broth plates containing 
X-B-galactoside (5-bromo-4^chloro-3-indoyl-fl-D-galacto- 
pyranoside) as a chromogenic substrate for 6-galactosidase and 
IPTG (isopropyl B~D-thiogalactopyranoside) as an inducer* 
About 50,000 recombinants were amplified on large (150 mm) 

35 NZY-plates according to Stratagene' s instructions. The titre of 
the resulting library was 5 x 10 9 pfu/ml with a total of 150 
ml. 
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Several positive clones were found by screening with anti-TPS/P 
serum. After thre rounds of purification, all clon s w re 
positive. They were screened again, now with anti-57K serum. 

5 For further manipulations of DNA, th plasmid part, 

pBluescript, of the Lambda Zap vector was excised as described 
in the manual for Predigested Lambda ZapII/EcoRl Cloning Kit 
(1989) . 

10 (b) Sequencing of TSS1 

A strongly positive clone from the Lambda ZapII library was 
selected and sequenced manually. The sequence obtained included 
an open reading frame that encoded a 58 JcDa protein, but none 
15 of the short chain peptide sequences disclosed in Example 3 was 
found in the amino acid sequence encoded by this ORF. 

Therefore r a second clone was selected, from a group of clones 
that gave distinct restriction maps compared with the group 

20 * including the first clone. It also responded less strongly to 
anti-57K serum, which is why it was not chosen in the first 
place. It was sequenced using the Exonuclease ITI/Mung Bean 
nuclease system for producing series of unidirectional 
deletions. The deletions were prepared according to 

25 Stratagene's manual for the pBluescript Exo/Mung DNA sequencing 
system. The plasmid was first digested with the restriction 
enzymes Sad, which leaves a 3' overhang, and BamHI, which 
leaves a 5' overhang. For filling in possible recessed 
3 'termini created by Mung Bean nuclease, 2.5 /xl of 10X 

30 nick-translation buffer, 1 /xl of dNTP (a mixture of all four 
dNTPs, each at 2 mM) and 1 /xl (2U) of Klenow fragment were 
added. The reaction proceeded for 3 0 min at room temperature 
and was then stopped with 1 (il of 0,5 M EDTA [Sambrook et al. 
(1989) loc. cit.]. The deletion time points were run on a 0.8 % 

35 low melting agarose gel. The bands were cut out, melted and 

ligated according to Stratagene's instructions. Portions (5 /xl) 
of each ligation mixture were used to transform XL-l Blue 



cells - 
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The clone prov d to encode all the short chain peptide 
sequences disclosed in Example 3, exc pt the poorly defined 
5 pentap ptide, 966b. It is notable that the anti-57K serum alone 
was an inadequate tool for cloning this gene: the amino acid 
sequence data disclosed in Example 3 were also essential • 
Comparison of sequences with the Microgenie Data Bank showed 
that the gene sequence of the clone was available as an unknown 

10 reading frame in the post-translational region of the gene for 
yeast (3. cerevisiae) vacuolar HVATPase. The data in the bank 
contain sequence errors, and have thus been erroneously 
interpreted as two short unidentified ORFs instead of one long 
ORF. The complete sequence of the TSS1 gene with 800 bp of 

15 promoter and 200 bp of terminator regions is disclosed as SEQ 
ID NQ:1 and the amino acid sequence deduced from its 'ORF 
(starting at nucleotide 796) as SEQ ID N0:2. SEQ ID N0:l now 
incorporates the following minor corrections to the promoter 
region , made since February 14th 1992: the original nucleotides 

20 60 and 61 (CA) become AC r original nucleotides 646 to 653 

(CGCGTGGT) become GCCGGG and the original nucleotide 711 (C) is 
deleted. Fig 3 A shows the promoter and terminator regions, and 
Fig 3b shows the deduced amino acid sequence. 

25 Example 5 Cloning and sequencing of TSL1 and TSL2 

(a) Preparation and screening of genomic DNA libraries 

The gene TSL1 was first found in the same library as described 
in Example 4. Screening was done using first aftti-TPS/P serum 

30 and then anti-93K serum. I*ater, another library was constructed 
from a partial EcoRl digest of chromosomal DNA from S. 
cerevisiae . strain S288C, using the methods described in 
Example 4. The anti-93K positive clones were classified by 
restriction mapping into groups, not all of which can represent 

35 TSL1. 
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(b) Sequencing of TSL1 

Clones from one group of anti-93K positive clones from the 
Haein library were partially sequenced manually and then 
5 aut matically from pBluescript exonuclease deleti n series as 
described in Example 4. 

The Haelll clones did not contain the whole of this long gene, 
and the N-terminus was not found in any clone* Therefore, the 
10 new EcoRl library was constructed and screened, first with 

anti-93 serum and then with nucleotide probes derived from the 
sequenced parts of TSL1. 

Several anti-93K positive clones, which also hybridized with 
15 the nucleotide probes, were obtained. These contained a plasmid 
with an 8*2 kb insert* From this plasmid a 2 kb fragment was 
cut with restriction enzymes StuI and Seal, religated into the 
pBluescript Smal site and sequenced using exonuclease 
deletions. The deletions were started using the enzymes SacI 
20 and Spel. Sequencing was done with the automatic sequencer . The 
sequence of TSL1 was thus completed* 

The complete sequence is contained in the 8*2 kb insert of the 
EcoRI clones, and has been deposited as plasmid pALK751 on 
25 February 18, 1992 with the Deutsche Sammlung von 

Microorganismen (DSM) , Gesellschaft fur Biotechnologische 
Forschung GmbH, Grisebachstr • 8, 3400 Gottingen, Germany and 
given the accession number DSM 6928. 

30 The sequence is shown as SEQ ID NO: 83. Nucleotides 2282 to 5575 
comprise an ORF that encodes the amino acid sequence SEQ ID NO: 
82. The promoter and terminator regions and amino acid sequence 
are also shown in Fig 4* The amino acid sequence includes the 
amino acid sequences obtained from (fragments of) the long (123 

35 kDa) chain of trehalose synthase disclosed and discussed in 
Example 3. 
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<c) Isolation and sequ ncing of TSL2 

The inf rxnation disclosed ab ut the 99 kDa polypeptide 
(especially in Examples 1 & 3) provides obvious procedur s for 
5 the isolation and characterization of the TSL2 gene by on 
ordinarily skilled in the art* Because the anti-93K serum 
recognizes the 99 kDa polypeptide, anti-93K positive clones 
isolated as described' above can include clones representing 
TSL2. Several positive clones not representing TSL1 were 

10 identified by restriction mapping* One of these was deposited 
on January 28th 1993 as the plasmid pALK756 (see Table 1) with 
the Deutsche Sammlung von Mikroorganismen und Zellkulturen 
GmbH, Mascheroder Weg 1 B, D-3300 Braunschweig, Germany 
(Accession number DSM 7425). This plasmid comprises a 3.5 kb 

15 insert in pBluescript. The insert was not cut by the 

restriction enzymes, Not I, Sad, Spel or Xhol. The sequences 
of these and similar clones can be examined to identify an ORF 
that encodes the amino acid sequences of peptides isolated from 
the 99 kDa polypeptide (viz., SEQ ID NO:S 29 to 38 and 44 to' 

20 49) . Another well established procedure is to use these amino 

acid sequences to design nucleotide primers that can be used to 
amplify parts of the TSL2 gene by the polymerase chain 
reaction. When a part of the TSL2 gene has been isolated and 
sequenced by either procedure, the rest of the gene can be 

25 easily isolated as described for TSL1. 

Example 6. Characterization of TSS1 and TSL1 . 

30 The nucleotide sequence of TSS1 encodes a polypeptide of 495 

amino acid residues with a calculated molecular mass of 56 kDa. 
This open reading frame starts with an ATG codon and ends with 
two TGA codons. The promoter region contains a TATA box at -186 
(see Fig 3) and the sequence CCCCGC at -270, which has been 

35 implicated in catabolite repression [Nehlin & Ronne, (1990) 

European Molecular Biology organization Journal 9, 2891-2898]. 
This may account for the low expression of trehalose synthase 
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in the presence of glucose disclosed in Example 2. 

The open reading frame of TSLl ncodes a polypeptide of 1098 
amino acids , corresponding to a calculated molecular mass of 
5 123 kDa. This ORF starts with an ATG codon and nds with a TAA 
codon. Sixty base pairs downstream from the TAA codon is a 
possible TATATA transcription termination element [Russo et al. 
(1991) European Molecular Biology Organization Journal 10, 563- 
571] . 

10 

The promoter sequence of TSLl contains two putative TATA boxes 
at -100 and -117. The promoter was searched for possible heat 
shock elements and four AAG6GG elements were found (-166, -180, 
-232 and -378). Of these, the one furthest upstream, at -378 , 

15 was part of the sequence GGTAAAAGGGGCGAA , which corresponds 
well to the HAS -360 heat shock stress control element 
GGTAAGGGGCCAA [Marchler, G. et al (1992) Yeast 8 r S154]. Two 
copies of the canonical heat shock element GAANNTTC were found, 
one at -353 and the other at -425; thus, one on either side of 

20 the UAS_ 360 element. 

The sequence GCCCCTGCATTTT at -327 could be a MIG1 protein 
binding site (the consensus sequence is TCCCCRGATTNT) . MIG1 
appears to act as a repressor of transcription in the presence 
25 of glucose [Nehlin, J.0. & Ronne, H. (1990) European Molecular 
Biology Organization Journal 9, 2891-2898; Nehlin, J.0. et al 
(1991) ibid 10, 3373-3378]. These features of the TSLl sequence 
are shown in Fig 4. 

30 The amino acid sequence encoded by TSLl contains two 

polyglutamine tracts, four Qs starting at amino acid 42 and 
five Qs starting at 164. Such glutamine-rich sequences have 
been associated with heteromeric protein-protein interaction 
[Gancedo, J.-M. (1992) European Journal of Biochemistry 206, 

35 297-313]. 



Fig. 5 discloses that the entire TSS1 gene exhibits 37 % 
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identity at the amino acid level to a 502 amino acid stretch 
from th middle of the TSL1 product. The genes are obviously 
clos ly related. 

5 Most surprisingly, the TSS1 gene is identical to the CIFl g ne 
that has been recently cloned and sequenced by Gancedo's group 
[Gonzales et al (1992) Yeast 8 183-192]. This disclosure 
reveals that special methodology is required to handle mutants 
containing modified forms of the TSS1 gene, because cifl 

10 mutants have severe defects in sugar metabolism, as discussed 
in the Detailed Description. It also explains, of course, why 
no recognisable short chain is present in the Klg 102 mutants, 
which carry the cifl mutation (see Example 7) . Previously, it 
has been (tacitly) assumed that failure of cifl and fdpl 

15 mutants to express TPS activity is the consequence of a lengthy 
cascade of regulatory effects. The findings disclosed here and 
in Example 7 show that absence of the short chain of trehalose 
synthase is the primary defect, from which, in an as yet 
completely obscure way, the other regulatory defects of these 

20 mutants result. 

5 . cerevisiae chromosomes were separated by pulsed field 
electrophoresis, with pulse times of 60 sec for 15 h and 90 sec 
for 9 h at 200 volts, as recommended by the instruction manual 

25 for the CHEF-DR II BioRad Laboratories, Richmond, California] . 
Genes were located using digoxigenin- label led non-radioactive 
probes f following the instructions in the manual by Boehringer 
Mannheim. The following probes were used: a 2,1 kb Dral 
restriction fragment from TSL1 and a 1.9 kb Narl-Smal 

30 restriction fragment of TSS1 (the Smal site is in the linker 

between the insert and the vector; important restriction sites 
in TSS1 and TSL1 are shown in Fig 6) . TSS1 was located 
exclusively on Chromosome 2, which is where both FDP1 [Van de 
Poll and Schambert (1977) loc. cit.] and CIFl [Gonzales et al. 

35 (1992) loc. cit.] have been located. This disclosure further 

strengthens the evidence for the identity of TSS1 with CIFl and 
FDP1. By using the GalH gene as a marker for chromosome 16 TSLi 
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was located exclusively on the adjacent Chromosome 13. 
Immediately downstream of TSL1 lies, in pposite orientation , 
the ARGRXX gene, sequenced by Messenguy et. al. [(1986) 
Europ an Journal of Biochemistry 157, 77-81]. The start of the 
overlapping sequence is shown in Fig 4. 

Example 7 A functional TSS1 gene is required for expression of 
both TPS and TPP activities 

The S. cerevisiae mutant Klg 102 , was obtained from Dan 
Fraenkel (Harvard Medical School) and has the genotype MATa, 
ural f leul, trp5, cif 1-102. It was routinely grown on YP/2% 
galactose or YP/2% glucose, and long term storage was under 
liquid nitrogen. As reported by others [Navon, G. , et al. 
(1979) Biochemistry 18, 4487-4499; Banuelos, M. & Fraenkel, 

(1982) Molecular and Cellular Biology 2, 921-929], this 
mutant would not grow on YP/2% fructose, though revertants were 
frequent. 

Six individual colonies from each of two substrains of Klg 102, 
ALKO 2669 and ALKO 2670, that differed in reversion frequency 
and colony size, were streaked onto YP/2% fructose and YP/2% 
glucose at 30 °C After 45 h, all 12 streaks were growing on 
glucose, although slower than the control yeast, X2180, but 
none showed any growth on fructose* After 4 days, five of the 
ALKO 2669 streaks showed several large, but isolated colonies 
on fructose and one ALKO 2670 streak showed several small 
colonies on fructose. From the glucose plates, three streaks 
from each substrain were chosen for the smallest number of 
revertants on the corresponding fructose plate, and used to 
inoculate 100 ml portions of YPD in 250 ml shake flasks, and 
grown at 200 r.p.m. and 30 °C. Three parallel flasks were 
inoculated with X2180. A600 and residual glucose in the media 
were monitored and samples were plated out quantitatively onto 
YP/2% glucose and YP/2% fructose. The ALKO 2669 cultures grew 
faster than the ALKO 2670 cultures, and both grew much slower 
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then X2180 (not shown) . 

At appropriate times the cells were harvested, brok n and 
analyzed as described in the General Materials and Methods • 
Results are shown in Table 5. 



50 

Table 5. Growth of Klcr 102 and X2180 strains on YPD 



10 



The cultures were performed as described in the text. Residual 
glucose and cell mass are given as, respectively, g/100 ml and 
mg/ml of growth medium. Phosphoglucoisomerase (PGI) was 
determined as described in Example 11* PGI f TPS and TPP are 
given as U/g of wet cells (TPS was determined in the presence 
of 5 mM F6P) . Trehalose is given as mg/g of wet cells. 
Viability Fru/Glu shows the number of cells able to 
grow on fructose as a percentage of the number of cells able to 
grow on glucose at the time of harvesting. Cells from the 
cultures 2670/1 and 2670/2 were combined for breakage and 
subsequent analysis. ND r not determined. 



15 



20 



25 



30 



35 



40 



Strain Age Residual Cell Mass PGI 
Glucose 
(h) (g %) 

Klcr 102 cultures 



(mg/ml) (U/G) 



TPS TPP Trehalose Viability 

Fru/Glu 

(U/G) (U/G) (mg/g) (%) 



2669/1 


24 


ND 


4.3 


88 


£0.02 


ND 


ND 


2.4 


2669/2 


48 


£0.02 


11.6 


81 


£0.03 


0.034 


ND 


£1.7 


2669/3 


114 


none 


10.3 


ND 


ND 


£0.02 


£0.22 


£1.8 


2670/1 
2670/2 


110. 
110 


none 


9.7 


89 


£0,03 


0.081 


ND 


1.4 
4.0 


2670/4 


114 


none 


11.2 


ND 


ND 


£0.02 


£0.19 


£0.3 


X2180 cultures 














1 


24 


nd 


19.1 


93 


6.3 


1.7 


ND 


ND 


2 


110 


none 


31.7 


126 


6.3 


2.3 


ND 


ND 


3 


114 


none 


34.4 


ND 


ND 


2.9 


29 .3 


ND 



These results show that TPS activity was below the detection 
level in the Klg 102 samples and less than 0.5 % of the value 
45 in X2180, which is typical of wild type S. cerevisiae . This 

agrees with previously reported results [Paschoalin, V.M.F., et 
al. (1989) Current Genetics 16, 81-87]. Surprisingly, however, 
TPP activities were also very low, betw en £ 1 % and 5 % of the 
X2180 values. Even this residual ability to hydrolyse 
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trehalose-6-phosphate is likely to b due to non-specific 
phosphatases. Paschoalin et al . [(1989) loc* cit.] claim that 
Klg 102 specifically lacks UDPG-linked TPS activity, but that, 
like the wild-type yeast S288C (which is the haploid form of 
X2180) , it contains an ADPG-linked activity. If this w re true, 
and accepting the conventional view that trehalose synthesis in 
yeast proceeds via free trehalose-6-phosphate, Klg 102 should 
contain significant TPP activity. Our results disclose that 
this is not the case. Furthermore, when we tested whether wild 
type yeast (X2180) was able to synthesise [ 14 C] -trehalose from 
[ 14 C]-G6P in the presence of UDPG or ADPG, we found significant 
activity only in the presence of UDPG. The assay systems used 
by Paschoalin et al, [(1989) loc. cit,] have been criticised by 
Vandercammen et al. [(1989) loc. cit.], so we tested thfc 
overall reaction directly. Yeast extracts were incubated in 40 
mM HEPES pH 6.8 containing 1 mg BSA/ml, 10 mM MgCl 2 and 10 mM 
[U U C]-G6P (736 c.p.m./nmol) in the presence or absence of 5 
mM UDPG or 2.5 mM ADPG and presence or absence of 5 mM K 
phosphate. Reactions were stopped by boiling for 2 min and' 
addition of AG1-X8 (formate) anion exchange resin, as in the 
TPP assay system described by Londesborough & Vuorio £(1991) 
loc* cit.]. Results are shown in Fig 7. Without UDPG or ADPG, 
radioactivity appeared in the resin supernatants , presumably 
due to phosphatases active on G6P. UDPG caused a clear increase 
in this rate in the absence of phosphate and a marked increase 
in the presence of 5 mM phosphate, which stimulates the TPP 
activity and inhibits the TPS activity of trehalose synthase ♦ 
With UDPG and 5 mM phosphate, the increase in rate 
corresponded, after a lag phase, to 0.94 /i*ol/min/g of fresh 
yeast, which is about 50 % of the TPP activity of this yeast at 
20 mM phosphate. ADPG, however, did not cause any significant 
increase in the rate of appearance of radioactivity in the 
resin supernatant, indicating that no ADPG-linked TPS activity 
was present. 

Western blots of the homogenates of Klg 102 and X2180 yeast are 
shown in Fig. 8. The origin of the bands marked D is not clear: 



52 



they may be degraded short chain • X2180 shows a strong 57. kDa 
band f due to the short chain of trehalose synthase and s veral 
weak bands at 100 to 130 kDa due to intact and truncated 
versions of the long chain. In contrast , although the Klg 102 
5 samples showed stronger long chain bands, because mor yeast 
sample was applied to the gel, they showed no trace of a short 
chain band. Thus, Klg 102 does not contain a recognisable form 
of the product of the TSSl gene (it might contain a truncated 
version lacking the epitopes recognised by our polyclonal 

10 antibodies) , but contains normal amounts of the TSL1 product. 
Furthermore, the TSL1 product appears to increase as Klg 102 
traverses the diauxic lag (compare e.g. lanes 3 and 2 of Fig. 
8) , suggesting that expression of the long chain of trehalose 
synthase in this yeast increases when all glucose is consumed. 

15 In wild type yeast, increases in both short and long chains 

occur concomitant with the increases in TPS and TPP activities 
when glucose is consumed (Example 2) . 

These results disclose that the failure of Klg 102 to express 
20 * immunologically recognisable short chain of trehalose synthase 
is correlated with the absence of both TPS and TPP activities. 
This unexpected behaviour, in contradiction of the views of 
Paschoalin et al. [(1989) loc. cit.], indicates that a 
functional short chain is required to assemble a trehalose 
25 synthase with either partial activity. 

Similar experiments were done with S» cerevisiae , strain MV6807 
(obtained from Johan Thevelein, Laboratorium voor Moleculaire 
Celbiologie, Instituut voor Plantkunde, Heverlee, Belgium) , 

30 which carries the fdpl mutation, which is allelic to CIF1 and 
TSSl. This strain grew poorly on glucose (fructose was not 
tested) and so was grown on galactose. Stationary phase cells 
contained 6 ± 6 % of normal TPS but about 20 % of normal TPP. 
Western analyses showed the presence of a band at 57 kDa 

35 recognised by anti-57K serum as well as normal long chain 
bands, so the mutation in MV6807 must be an aminoacid 
substitution. Apparently, this substitution causes a greater 
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decrease in TPS activity than TPP activity. 

Example 8/ Biochemical evidence that a long chain of trehalose 
synthase is required for TPP activity 

5 

Truncated trehalose synthase containing the short (57 JcDa) 
chain and the 86 and 93 kDa long chain fragments was prepared 
according to the method of Londesborough & Vuorio (1991) loc. 
cit»] f or proteolytically activated TPS/P complex. TPS and TPP 

10 activities were assayed as described by Londesborough & Vuorio 
[(1991) loc. cit.]. [N~ethyl-l- u C]- maleimide (ethyl-labelled 
NEM; 40 mCi/mmol) was NEC-454 from New England Nuclear ♦ 
N-ethyl- [2 ,3- 14 C] -maleimide (ring-labelled NEM; 6 mCi/mmol) was 
CFA 293 from Amersham International. Both were obtained as 

15 solutions in n-pentane and the manufacturer's stated specific 
activities were assumed to be correct. Unlabelled N-ethyl - 
maleimide (NEM) was E-3876 from Sigma. It was dissolved in 25 
mM HEPES pH 7.0 immediately before use and standardized by 
absorption measurements at 305 nm, assuming an E** of 0*62. 

20 

Treatment of truncated trehalose synthase with 1.9 mM NEM at 24 
a C in the presence of about 0.17 mM dithiothreitol (which 
presumably rapidly consumes about 0.34 mM NEM) caused a rapid 
and essentially complete 98 %) loss of TPP activity, but 
25 little (£ 24 %) loss of TPS activity (Fig. 9) . This suggested 
that NEM modified one or more amino acid (presumably cysteine) 
side chains that are required intact for TPP but not for TPS. 

To permit quantitative experiments with low concentrations of 
30 labelled NEM, the dithiothreitol in the enzyme preparation was 
removed by gel-filtration through Pharmacia NAPS columns 
equilibrated with 1 mg BSA/ml of 25 mM HEPES pH 7.0 containing 
2 mM MgCl 2 , 1 mM EDTA and 0.2 M NaCl. Recoveries of TPS and' TPP 
activities through this gel-filtration were above 85 %. 

35 

In one experim nt, 2.0 /il of 2,4 mM ethyl-labelled NEM was 
mix d with 150 til of gel-filtered enzyme and incubat d at 23 
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°C. Samples (10 /xl) taken at various times up to 190 min were 
mixed with 60 pi of Laemmli sample buffer (the mercapto-ethanol 
in this buffer should destroy residual NEM) , boiled for 5 min 
and subjected to SDS-FAGE* At closely similar times (and also 
at 23 h) other samples (10 nl) were mixed with 100 pi (for TPS) 
or 700 Ml (for TPP) of 5 mg BSA/ml 25 mM HEPES pH 7.0 
containing 2 mM MgCl 2 , 1 mM EDTA, 0.2 M NaCl and 1 mM 
dithiothreitol (the dithiothreitol should destroy residual NEM) 
and assayed for TPS and TPP* The enzyme dilution used for the 
TPP assay was sufficient that radioactivity from the NEM (about 
1/3 of which remains in the resin supernatant) did not 
interfere with the TPP determinations. 

After electrophoresis, the upper (cathode) buffer, containing 
most of the added radioactivity, was completely removed before 
disassembling the apparatus. The gel was then fixed, stained 
and destained as described by Laemmli [(1970) Nature, Londofi 
227, 680-685] and dried. An autoradiogram of this gel (Fig. 10) 
showed that the 93 kDa band (and also BSA) became labelled 
during the experiment, while the 86 and 57 kDa bands were much 
more weakly labelled. The Coomassie blue stained bands and 
adjacent, empty areas (as blanks) were cut out of the dried gel 
(in later experiments, they were cut from undried gels), broken 
up and extracted overnight with 1 ml of 5 % SDS * in pre-blanked 
scintillation vials. Then 10 ml of a toluene/ Triton XlOO-based 
scintillant was added, and the tubes were repeatedly counted 
using a wide energy window to minimise quench effects. After 10 
h constant counting levels were reached* Excess radioactivity 
was calculated by subtracting a blank value obtained from empty 
regions of the gel. Results are shown in Fig* 11. In control 
experiments, in which enzyme was omitted, it was shown that the 
excess radioactivity found in the 93 and 86 kDa bands did not 
originate from potential labelling of impurities in -the BSA. 

Fig. 11 shows that label from NEM enters mainly the 93 kDa 
fragment of the long chain, with relatively small amounts 
entering the 86 kDa fragment and the 57 kDa short chain. Also, 



55 



the amount of label entering the long chain fragments (93 + 86 
kDa) is roughly proportional to the loss of TPP activity, but 
lags increasingly behind this loss: at 10*5 min 30 % of the 
initial TPP was lost and 0.20 moles of NEM had entered the long 
chain fragments p r mole (150 Kg) of enzyme, whereas at 190 
min, 56 % of TPP was lost and 0.32 moles of NEM had entered the 
long chain fragments. Possibly, since trehalose synthase may be 
an octamer (its native molecular mass is about 800 kDa), 
reaction of one long chain with NEM can eventually lead to loss 
of activity associated with the other long chains in the 
octamer. Fig. 12 collates data from several experiments, using 
both ring- and ethyl-labelled NEM. Parallel experiments with' 
identical concentrations of ring- and ethyl-labelled NEM 
suggested that about 25 % of the radioactivity from ethyl- 
labelled NEM originally fixed* in the protein was lost during 
SDS-PAGE processing (some loss is expected in acidic? 
condition) , and the results with ethyl-labelled NEM have been 
corrected accordingly. Within the limits of accuracy (a 
specific activity of 30 TPS units /mg was used to calculate the 
mass of protein and a dimer molecular mass of 150 kDa was 
assumed for the truncated enzyme) complete loss of TPP 
reflected incorporation of rather less than 1 mole of NEM into, 
specifically, the long chain fragments. 

Another reagent with high specificity for cysteine, 
dithiodinitro-benzoate (DTNB) , also caused a specific loss of 
TPP activity: after 10 min treatment with 0,9 mM DTNB over 95 % 
of the TPP was lost and less than 28 % of the TPS. 

These findings disclose that TPP activity requires a long chain 
with a proper structure, because modification of a single amino 
acid (presumable cysteine) residue in the 93 kDa fragment 
eliminates TPP but not TPS activity. Sequencing data given in 
Example 3 disclosed that the 93 kDa band contained material 
from both the 99 kDa and 123 kDa long chains. Thus, the present 
results disclose that either the 99 kDa or the 123 kDa or both 
long chains are involved in TPP activity. 
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Example 9. An isolated 99 kDa polypeptide from trehalose 
synthase contains TPP activity 



Because the long and short chains of trehalos synthase were < 
5 dificult to separate by usual chromatographic procedures, 

fractionations were attempted in the presence of a non-ionic s 
detergent* During fractionation with a NaCl gradient on 
DEAE-cellulose (Whatman DE52) in 1 % Triton X100 at pH 8.0, the 
enzyme was recovered in about 90 % yield at 140 mM NaCl. Some 

10 minor polypeptides (e.g. the weak 68 kDa polypeptides visible 
in Fig 1) were removed, but the main 57 , 99 and 123 kDa 
polypeptides were not resolved. However, the ratio of the 99 
and 123 kDa bands changed from about 1.5 to 0.3 across the 
enzyme peak, while concomitantly the TPP/TPS ratio decreased 

15 steadily from 0.54 to 0.42 (data not shown). This suggested 

that the procedure was partially resolving trehalose synthase 
molecules enriched in the 99 kDa polypeptide from those 
enriched in the 123 kDa polypeptide and that the former had a 
relatively higher TPP activity. By extrapolation it can be. 

20 calculated that the TPP/TPS ratio of (hypothetical) enzyme 
containing only 57 and 99 kDa chains would be 0.65 ± 0.10, 
whereas that of enzyme with only 57 and 123 kDa chains would be 
0.32 ± 0.10. 

25 Because the long chain appears to contain an avid phosphate 
binding site (see Examples 10 and 12) , chromatography on 
phosphocellulose was attempted. Native trehalose synthase (4.2 
TPS units) was transferred above a PM10 membrane in an Amicon 
cell to 25 mM HEPES pH 7-0 containing 2 mM MgC12, 1 mM EDTA, 1 

30 mM dithiothreitol and 0.3 % Triton X100 (HMED/0.3 %T) and 

applied to a 0.7 x 4.2 cm column of phosphocellulose (Whatman 1 
Pll-cellulose) equilibrated with the same buffer. The column 
was washed with 4 ml of HMED/0.3 %T and. developed with a linear 3 
gradient from zero to 0.6 M NaCl in 60 ml of HMED/0.3 %T at 5 

35 ml/h. By 0.35 M NaCl only traces of TPS had been eluted (£ 3 % 
in the first 9 ml and < 9 % spread between 0.15 and 0.35 M 
NaCl) . The gradient was interrupted and the column was washed 
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sequentially with (a) 8 ml of 10 mM fructose-6-phosphate in 
HMED/0.3 %T/0.35 M NaCl, (b) 6 ml of HMED/0-3 %T/0.6 M NaCl and 
(c) 0.2 M K phosphate pH 7.0/2 mM MgCl 2 /l mM EDTA/1 mM 
dithiothreitol. No TPS or TPP activity was recovered except in 
a single 1.5 ml fraction in which the 0.6 M NaCl began to 
elute. This contained 12 % of the applied TPP, but £0.1 % of 
the applied TPS. 

Fractions were examined by SDS-PAGE (Fig. 13), which showed: 

(1) almost pure short chain eluted at and just before the start 
of the NaCl gradient in fractions devoid of enzyme activity; 

(2) traces of short and long chain eluted diffusely at about 
0.2 to 0.35 M NaCl in fractions containing altogether £ 7 % of 
the applied TPS activity; (3) at least 50 % and possibly all of 
the applied 99 kDa polypeptide eluted at 0.6 M NaCl in the 
fraction containing 12 % of the applied TPP activity; and (4) 
most of the 123 kDa polypeptide remained bound to the column. 

Intact trehalose synthase has also been fractionated on 
phosphocellulose in the absence of Triton, and with elution by 
a simple linear gradient from 0 to 0.6 M NaCl. Pure or nearly 
pure 99 kDa polypeptide eluted' at about 0.45 M NaCl and 
contained specific TPP activity ( 14 C-G6P was not hydrolyzed) . 
This activity differed from the TPP activity of intact 
trehalose synthase in that the ratio of activities at 25 mM 
phosphate and 50 mM Hepes was between 1.5 and 3 in different 
experiments (cf , this ratio is 5 to 6 for intact trehalose 
synthase) . Furthermore, during storage of the isolated 99 kDa 
polypeptide at 0 °C, the TPP activity at 25 mM phosphate 
decreased and that at 50 mM Hepes increased r until the ratio 
was about 0.7 after 7 weeks. 

These findings disclose that the 99 kDa polypeptide isolated 
from intact trehalose synthase is a specific trehalose-6-r 
phosphatase, but that its catalytic properties are unstable 
and differ from the TPP activity of intact trehalose synthase. 
Tog ther with the disclosure in Example 7 that y ast requires a 
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properly functional TSS1 g ne to exhibit TPP activity , the 
results suggest that proper folding of the 99 kDa polypeptide 
requires th presence of the 57 kDa chain* 

5 Thes findings also disclose that when th short chain is 
separated from the long chain by chromatography in a buffer 
containing 0.3 % Triton, in which intact trehalose synthase is 
stable, it rapidly looses any TPP or TPS activity it possessed 
when correctly folded in the trehalose synthase. 

10 

The findings also indicate that the full-length long chain has 
extraordinarily high affinity for phosphocellulose, which is 
consistent with the location of a high affinity phosphate 
binding site in a terminal portion of this chain as suggested 
15 by Examples 10 and 12. 

Example 10 Truncation of the 123 kDa long chain of trehalose 
synthase bv trypsin in vitro dramatically increases TPS 
activity 

20 

Removal, of the N-terminal 325 or so amino acids from the 123 
kDa long chain of intact trehalose synthase by treatment with 
trypsin in vitro produces an enzyme with catalytic properties 
like those of the truncated enzyme purified by Londesborough & 

25 Vuorio [(1991) loc. cit« 1 . In one experiment intact trehalose 
synthase (0.28 TPS units, « 9.4 fig) was incubated with or 
without 0.5 fig of trypsin at 30 °C in 250 /il of 13 mM HEPES pH 
7.0 containing 1 mM MgCl 2 , 0.5 mM EDTA, 0.5 mM dithiothreitol, 
0.2 M NaCl and 0.5 mM benzamidine. Its TPS activity was 

30 determined at intervals using standard assay mixtures 

(containing 5 mM F6P) containing no or 4 mM K phosphate pH 6.8, 
and samples were prepared for SDS-PAGE analysis immediately 
before and 48 min after addition of the trypsin. 

35 During the first 48 min the TPS activity measured in the 
absence of phosphate decreased faster in the presence of 
trypsin than in its absence. However, in the first 10 min, 
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trypsin caused a 4-fold increase in the activity measur d at 4 
mM phosphate, and by 48 min the activities with and without 
phosphate w re essentially equal (Fig. 14) . By 48 min, the 123 

<* JcDa full length long chain had disappeared and been replaced by 

5 a doublet of polypeptides at 85 kDa (Fig. 15) . In contrast, the 

v short chain (57 kDa) was unchanged and the 99 kDa band was only 

slightly decreased in strength. The changes in TPS activity 
were accompanied by loss of about 50 % of the TPP activity. 

10 Which part of the 123 kDa chain was removed by trypsin was 

determined as follows. Intact trehalose synthase (180 fig) was 
transferred to 0.5 ml of 25 mM HEPES pH 7.0 containing 2 mM 
MgCl 2 , 1 mM EDTA, 1 mM dithiothreitol and 0,2 M NaCl using a 
Centricon 30 tube, and then treated with 11 nq trypsin at 25°C. 

15 The standard TPS activity did not decrease during the trypsin 
treatment, whereas TPS activity measured in the absence of F6P 
and presence of 10 mM phosphate increased from 26.% to 73 %' of 
the standard activity during the first 30 min of treatment. 
After 68 min treatment, when SDS-PAGE analysis showed the 

20 complete disappearance of the 123 and 99 kDa bands and 

appearance of a doublet with apparent molecular mass about 85 
kDa (the components differing by about 1*5 JcDa) , the mixture 
was centrifuged through a Centricon 30 tube to separate the 
tryptic peptides from the core enzyme. The retentate was then 

25 boiled in 0.5 % SDS and again centrifuged through the Centricon 
30 tube. The combined filtrates were diluted to 0.1 % SDS and 
incubated for 18 h at 25 °C with 4 % by weight of 
endoproteinase Glu-C (Boehringer) . The peptides were then 
separated by HPLC using a DEAE pre-column and sequenced as 

30 described in Example 3. 

Twenty sequences were obtained (Seq ID NOs 58 to 77 in Table 
6) . Fifteen of these were found in the N-terminal 325 amino 
acids coded by TSL1. One (peptide 1407, recovered at less than 
half the yield of the others) was amino acids 1089 - 1093, 
i.e., 5 amino acids from the C-terminus of the protein coded by 
TSL1. This peptide is presumably derived by endoproteinase Glu- 
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c cleavage of the tryptic peptide starting after Lys 1079. Both 
the 86 and 93 kDa long chain fragments in the truncated 
trehalose synthase purified by Londesborough & Vuorio [(1991) 
loc. cit. ] are disclosed in Example 3 to contain a p ptide 
5 (1483b & 1484b) derived from Alal064 to Lysl079, confirming 
that the truncated polypeptides extend at least this close to 
the C-terminus of the full length 123 JcDa chain. The N-terminal 
peptide furthest from the N-terminus was peptide 1443 , obtained 
by cleavage after Arg 335. Thus, the truncated long chain 

10 extends from Ser 336 to Lys 1079 or Asp 1098, and is predicted 
to have a molecular mass of 87.3 or 86.2 kDa. The SDS-PAGE 
analysis of tryps in-treated enzyme suggests both of these 
truncated chains are formed, and because the TPS activity in 
the presence of F6P changes little during the trypsin 

15 treatment, the two truncated chains probably have similar 
activities .* 

Of the remaining four peptides in Table 5, two (1419b and 
1437b) are still unidentified, but may originate from the 99 
20 kDa polypeptide, whereas two (1442 and 1451) clearly originate 
from that polypeptide. Thus, peptide 1442 is identical to 
peptide 1307a of Table 4, and the first 5 amino acids of 
peptide 1451 are identical to peptide 1297a (Table 4). 



25 These results disclose that removal of the N-terminal 325 amino 
acids of the long chain, with or without removal of the O 
terminal 19 amino acids, results in a trehalose synthase that 
is relatively insensitive to inhibition by phosphate, and does 
not require F6P for full activity. Analysis of the secondary 

30 structure of the long chain according to Garnier et al [(1978) 
Journal of Molecular Biology, 120, 97-120] suggests that 
whereas the C-terminal 700 amino acids are likely to be in 
alpha-helices or beta-sheets, the N-terminal 360 amino acid 
portion of the protein is relatively. devoid of such structures. 

35 Taken together, these data suggest that the N-terminal 330 or 
so amino acids comprise a distinct domain, that conf rs 
regulatory properties upon the TPS activity of trehalose 
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synthase, including sensitivity to inhibition by phosphate and 
a requirement for F6P to expr ss full catalytic activity. Thus, 
the TSL1 gene product must also be involved in TPS activity. 

Table 6 Peptides r leased from intact trehalose synthase during 
activation bv limited treatment with trypsin. 

When two sequences were obtained from the same RPLC peak, they 
are shown as a and b sequences, assigned according to the 
sequences predicted from the TSL1 gene. Tentative 
identifications from the amino acid sequencer are shown by one 
letter codes and double queries; unidentified residues Xaa. (In 
the Sequence Listings also tentative identifications are 
indicated as Xaa) . The location of each amino acid sequence in 
the long (123 JcDa) chain of trehalose synthase in Fig 4b is 
shown below the sequence. 

1400 Leu-Leu-Val-His-Ser-Leu-Leu-Asn-Asn-Thr-Ser-Gln-Thr-Ser- 
Leu-Glu-Gly-Pro-Asn 

(SEQ ID NO: 58) (181-200) 

1401 Ser-Ser-Thr *Thr-Asn-Thr-Ala-Thr-Leu-Xaa-Xaa-Leu-Val- Ser *- 
Ser-Xaa-Ile-Phe-Met-Glu 

(SEQ ID NO:59) (84-104) 

1406 Ala-G??-Asn-Arg-Pro-Thr-Ser-Ala-Ala-Thr-Ser-Leu-Val-Asn- 
Arg 

(SEQ ID NO: 60) (210-24) 

1407 Xaa-Phe-Thr-Ile-Ile-S?7 
(SEQ ID NO: 61) (1088-93) 

1408 Asn-Leu-Thr-Ala-Asn-Ala-Thr-Thr-Ser-His-Thr-Pro^Thr-Ser- 
Lys 

(SEQ ID NO: 62) (105-19) 
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1409 Ph -G??-G??-Tyr-Ser-Asn-Lys 
(SEQ ID NO: 63) (319-25) 

14 16 S? ? -Pro-S? ? -Ala-Phe-Asn-R?? 
5 (SEQ ID NO: 64) (77-83) 

1417a Ile-Ala-Ser-Pro-Ile-Gln-T??-Glu 
(SEQ ID NO: 65) (145-52) 

10 1417b Gln-Arg-Pro-Leu-Leu-Ala-Lys 
(SEQ ID NO: 66) (257-63) 

1418 Phe-Phe-Ser-Pro-Ser-Ser-Asn-Ile-Pro-Thr-Asp-Arg 
(SEQ ID NO: 67) (133-44) 

15 

1419a Ala-Leu-Ser-Asn-Asn-Ile-Ser-Gln-Glu 
(SEQ ID NO: 68) (47-55) 

1419b A??-L??-S??-Tyr-Thr-Pro 
20 (SEQ ID NO: 69) (not found) 

1420 Ile-Ala-Ser-Pro-Ile-Gln-Gln-Gln-Gln-Gln-Asp-Pro-Thr-Ala- 
Asn-Leu 

(SEQ ID NO: 70) (159-74) 

25 

1437a Thr-Met-Leu-Lys-Pro-Arg 
(SEQ ID NO: 71) (120-25) 

1437b Ile-Ile-Glu-Asp-Glu-Ala 
30 (SEQ ID NO: 72) ((not found) 

1438 Ile-Thr-Pro-His-Leu-Thr-Ala-Ser-Ala-Ala-Lys 
(SEQ ID NO: 73) (246-56) 



35 1439 



Ser-Leu-Val-Ala-Pro-Ala-Pro-Glu 
(SEQ ID NO: 74) (56-63) 
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1442 Lys-Pro~Gln-Asp^eu-Asp-Asp-Asp-Pro-Leu-Tyr-Leu 
(SEQ ID NO: 75) (fr m 99 JcDa) 

1443 Lys-Tyr-Ala-Leu-Leu-Arg 
(SEQ ID NO:76) (330-35) 

1451 Gln-Leu-Gly-Asn-Tyr-G??-Phe-Tyr-Pro-Val-Tyr 
(SEQ ID NO: 77) ((from 99 kDa) 



Example 11 Identification of the TPS activator as 
phosphoaj-ucoisomerase 

TPS activator was transferred to 0,1 M Tris/HCl pH 9.0 above a 
PM10 membrane in an Amicon cell. A 300 /il sample (34 /xg) was 
digested for 20 h at 37 °C by 0.8 ng of lysylendo-peptidase Q 
(Wako) . Peptides were separated by HPLC and sequenced as 
described in Example 3. All five sequences obtained and 
disclosed in Table 7 are identical to sequences found in yeast 
phosphogluco isomer ase (PGI) . 
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Table 7. Peptide sequences from TPS activator 



5 



10 



The PGI sequences are from Tekamp-Olson, P., et al. (1988) Gene 
73, 153-161. 

TPS-Activator Peptide PGI Residues 

TA1156 XFTNYDGSK 51-59 
(SEQ ID NO:39) 

TA1158 TGNDPSHIAK 241 - 251 

(SEQ ID NO: 40) 



TA1159 IYESQGK 24-30 
15 (SEQ ID NO: 41) 

TA1160 AEGATGGLVPHK 456 - 467 

(SEQ ID NO: 42) 

20 TA1161 LATELPAXSK 11 - 19 

(SEQ ID NO: 43) 



25 The PGI activity of a sample of TPS activator that had been 
stored for several months at 0 °C was measured in 50 mM 
HEPES/KOH pH 7.0, 5 mM MgCl^ 5 mM F6P and 0.4 mg/ml NADP. A 
specific activity of 190 U/mg was found. 

30 These findings disclose that TPS activator from S» cerevisiae 
is identical to PGI. Example 12 discloses that F6P is a 
powerful activator of the TPS activity of intact r but not of 
truncated, trehalose synthase. Because the assay mixtures for 
TPS contain G6P, it is clear that TPS activator can activate 

35 TPS by producing F6P from the substrate G6P. This is a complete 
explanation for the activation. Thus, at initial concentrations 
of 6.7 mM G6P and 1.9 mM F6P (i.e. G6P/F6P « 3.5, the 
experimental equilibrium ratio) the rate was independent of TPS 
activator and equal to that at 9 mM G6P with TPS activator. 

40 * Previous investigations [Londesborough $ vuorio (1991) loc. 
cit. ] had to use crude preparations of intact trehalose 
synthase because pure intact trehalose synthase was not 
available. Although the effectiveness of TPS activator 
preparations was reported to vary between different enzyme 
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preparations, under certain circumstances data were obtained 
that sugg sted TPS activator might int ract stoichiometrically 
with native tr halose synthase [Londesborough & Vuorio (1991) 

s loc. cit.]. The pr sent findings show that this suggestion was 

5 compl tely incorr ct. The findings also imply that kin tic data 

e in the literature are confused, because some preparations of 

so-called "trehalose-6-phosphate synthase" will have contained 
PGI whereas some may not. With the former preparations, the 
activator F6P will have been generated from the substrate G6P, 
10 but the amount so generated will have depended upon the details 
of the experimental procedure used. 

Example 12. The different kinetic behaviours of intact and 
truncated trehalose synthase 

15 

Truncated trehalose synthase was prepared as described by 
Londesborough & Vuorio [(1991) loc. cit.] and contained the 57 
kDa short chain and 86 and 93 kDa fragments of the long, chain. 
Intact trehalose, synthase was prepared as in Example 1. Kinetic 
20 • assays were done at 30 °C as described in General Methods and 
Materials. 



o 
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(sl) The TPS Partial Activity 

Table 8, Inhibition of the TPS activities of intact and 
truncated enzvme bv phosphate at 5 mM F6P 

5 

The effect of adding K phosphate pH 6.8 to standard assay 
mixtures (10 mM G6P, 5 mM UDPG and 5 mM F6P) is shown. For each 
enzyme , the activity without phosphate is set at 100 %. 

10 Added Phosphate Intact Enzvme Truncated Enzyme 

None 100 % 100 % 

1.3 mM 69 % 94 % 

4.0 mM 14 % 83 % 

15 



The TPS activity of intact enzyme was much more sensitive to 
inhibition by phosphate than was that of the truncated enzyme 

20 (Table 8)* The results in Table 8 underestimate the difference 
between the phosphate responses of intact and truncated enzyme, 
because F6P partially reverses the phosphate inhibition of 
intact enzyme (see below) but has virtually no effect on 
truncated enzyme. Table 9 shows the effect of shifting from the 

25 salt conditions of the standard assay (40 mM HEPES/KOH pH 6.8, 
10 mM MgCl 2 ) to conditions closer to those of yeast cytosol. in 
the absence of F6P, the shift caused 67 % inhibition of intact 
enyzme (from 43 % to 14 % of the standard activity) but only 10 
% inhibition of truncated enzyme (from 96% to 86%)* 

30 

Table 9. Effect on the TPS activity of intact and truncated 
enzyme of shifting to more physiological salt conditions 

For measurements at "physiological conditions " , 1.3 mM K 
35 phosphate and 0.1 M KC1 were added to the standard assay 
mixtures and the MgCl 2 was decreased from 10 to 2.5 mM. 
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Standard Cond. 

(5 mM F6P) No ?€? 



Physiological Cond, 

5 mtt $Q m> 



Intact 100 % 
Truncated 100 % 



43 % 
96 % 



72 % 
90 % 



14 % 
86 % 



These results disclose the insensitivity of the TPS activity of 
truncated trehalose synthase to physiological phosphate 
concentrations and the presence or absence of F6P at a 
concentration well above the normal value in yeast cytosol 
(between 0.1 and 1 mM; Lagunas, R. & Gancedo, C. (1983) 
European Journal of Biochemistry 137 , 479-483), 

Fig. 16 illustrates the F6P-dependence of the TPS activity of 
intact enzyme at different phosphate concentrations. Double- 
reciprocal plots of these data (not shown) indicate that at 1.3 
mM phosphate, and perhaps at 4 mM phosphate, sufficiently high 
concentrations of F6P completely overcome the inhibition by 
phosphate. With no added phosphate, F6P caused a maximum 
activation of 2.5-fold, with a of 60 #M. At 1.3 mM 
phosphate, the maximum activation was at least 20-fold, and the 

was 1.4 mM F6P. The slopes * of these double-recipropal plots 
varied linearly with the square of the phosphate concentration, 
suggesting that : two phosphate binding sites are involved. At 4 
mM phosphate, which is still within the probable range of 
phosphate concentrations in yeast cytosol [Lagunas & Gancedo 
(1983) loc. cit], inhibition was so severe that even 10 mM F6P 
permitted only 40 % of the activity observed under standard 
conditions. Thus, expression of a truncated trehalose synthase 
in yeast would be' expected to cause a large increase in the 
intracellular specific activity of the enzyme. 

Fructose-l-phosphate , f ructose-i , 6-bisphosphate , f ructose-2 , 6- 
bisphosphate and glucose-l-phosphate were tested at sub-optimal 
F6P concentrations (1 mM F6P at 1.3 mM phosphate). None caused 
activation at 5 or 2.5 mM concentrations; instead inhibitions 
of about 25 % occurred, probably due to competition with G6P 
and F6P. 
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fb) The TPP Partial Activity, 

At phosphate concentrations equal to r less than 1 mM, the 
progress curves of TPP reactions catalysed by truncated s 
5 trehalose synthase accelerat d mark dly over at least the first 

10 min of reaction. This did not happen with intact enzyme. For i 
the initial rates of reaction, intact enzyme was activated by 
smaller phosphate concentrations than was truncated enzyme 
(Fig. 17) . For truncated enzyme, double-reciprocal plots of the 

10 activation (va = the rate with phosphate, v pff minus the rate 
without phosphate, v 0 ) were linear when 1/va was plotted 
against 1/ [phosphate] , with a of 3 mM phosphate. For intact 
enzyme these plots were non-linear, and linear plots resulted 
when 1/ [phosphate] 2 was used (Fig. 18). This, again, suggests 

15 that intact enzyme has two strong phosphate binding sites, one 
of which is lost in the truncated enzyme. For intact enzyme, 
half maximal activation was obtained at 0.6 mM phosphate. 

In the absence of phosphate, F6P did not affect the TPP 

20 activity of intact enzyme. At sub-optimal phosphate 

concentrations, 5 mM F6P caused modest (20 to 30 %) inhibitions 
of the TPP activity of both intact and truncated enzymes, and 
at saturating phosphate concentrations, smaller inhibitions (10 
to 15 %) were observed, (data not shown) . 

25 . 

These findings disclose a profound sensitivity of the TPS 
activity of intact trehalose synthase to physiological 
phosphate and F6P concentrations that is lost by truncation of 
the 123 kDa long chain to about 85 kDa. The effects of 

30 truncation are less marked on the TPP activities , both enzymes 
being activated by physiological phosphate concentrations, and 
neither showing a strong response to F6P. The data suggest that 
intact enzyme has two strong phosphate binding sites, one of * 
which is located in the region of the 123 kDa long chain 

35 removed by truncation. The finding that the 123 kDa long chain 
could not be recovered from phosphocellulose, disclosed in 
Example 9 supports this conclusion. 
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Example 13 Expression of TPS activity in Escherichia coli cells 
transformed with TSS1 and TSL1 

E.coli , strain HB101 (ALKO 683) was transformed with the 
plasmids pALK752 and pALK754 consisting of pBluescript 
containing TSS1 and TSL1, respectively (see Example 14 and Figs 
20a and 20c) . Transf ormants ALK03566 and ALK03568 containing, 
respectively, pALK752 and pALK754 were selected and maintained 
by growth in the presence of 50 fig /ml of ampicillin. Shake 
flasks containing Luria Broth with no ampicillin (ALKO 683) or 
15 /xg/ml ampicillin (AIiK03566 and ALK03568) were inoculated 
with 1 ml of a suspension (A600 = 1.5) of the appropriate cells 
and shaken at 250 rpm and 30 °C for 15 h. Cells were harvested 
(5 min and 3000g) , washed twice with water, suspended (1.5 g 
cells/ 3. 7 ml) in - HBMED containing 1 mM PMSF and 10 tig /ml 
pepstatin A, and broken by two passes through a French press 
(Aminco) at 15 000 psi. Samples of the homogenates were 
centrifuged 20 min at 28 OOOg. Homogenates and supernatant s 
were assayed for TPS and TPP at once and the protein contents 
of the supernatants were determined (Table 10) . 

Table 10 . Expression of TPS activity in E. coli transformed with 
TSSl and TSL1. 

Host (ALKO 683) and transf ormants (ALK03566 containing TSSl and 
ALK03568 containing TSL1) were grown, harvested and broken as 
described in the text. Cell homogenates and supernatants were 
assayed at once for TPS, using the standard assay and a blank 
assay from which G6P and F6P were omitted, and for TPP. 
Activities are shown as mU/g fresh cells unless stated 
otherwise. 
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Cell yield (g/200 ml) 

Homoaenates 
Standard TPS 
TPS Blank 
Net TPS 



ALKO 683 
1.57 



361 ± 75 
363 + 57 
0 ± 20 



ALKQ3566 
1.51 



1065 ± 118 
227 ± 45 
840 ± 70 



ALKQ3568 
1.56 



260 ± 23 
117 ± 77 
140 ± 50 



Standard TPP 



1130 ± 70 



1110 ± 100 1190 ± 80 



Supernatants 

Standard TPS 273 ± 73 699 ± 47 233 ± 68 

TPS Blank 263 ± 21 155 ± 42 135 + 9 

Net TPS 10 ± 50 540 ± 10 100 ± 60 

Net TPS (mU/mg protein) 0.08 4.50 0.87 

Standard TPP 1130 ± 100 910 ± 90 1020 ± 80 



Standard and blank TPS assays both showed accelerating progress 
curves and results in Table 10 are mean ± range of 5 min and 10 
min assays, which were handled separately to calculate* the net 
TPS activity. Essentially all of the standard TPS activity 
measured in the host cells and about half of that in ALK03568 
cells was due to a blank reaction (presumably a- phospho- 
diesterase) generating UDP from UDPG in the absence of G6P and 
F6P. The net TPS activity of host cells grown under these 
conditions was close to zero, whereas cells transformed with 
TSS1 or TSL1 contained 840 or 140 mU/g fresh cells, most of 
which (64 % and 71 %, respectively) was soluble. Compared to 
the host preparation, the specific activities of the net TPS in 
the 28 000 g supernatants were increased about 50 -fold 
(ALK03566) and 10-fold (ALK03568) . There are probably two 
reasons for the very low TPS activity of the host cells: 
trehalose-6-phosphate synthase of E. colj . is induced by high 
osmotic strength, and although some strains also acquire 
activity in stationary phase, the enzyme activity itself is 
strongly activated by higher (0.25 M) cation concentrations 
than in our assay conditions [Giaever et al (1988) Journal of 
Bacteriology 170, 2841-2849]. 
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No significant change in the TPP activities was observed* Host 
cells already contained 1100 mU/g of TPP measured in 25 mM 
phosphate (and mor than 5 U/g measured in 25 mM Hepes buffer) . 
If transformation with the plasmids would have generated TPP 
5 activity with a TPP/TPS ratio the sam as in pure trehalose 

synthase from yeast, then the increments in TPP (about 250 and 
40 mU/g for ALK03566 and ALK03568, repectively) would have been 
undetectable for ALK03568 and close to the experimental error 
for ALK03566. 

Western analyses (Fig. 19) showed that ALK03566 specifically 
expressed a 57 kDa band recognized by anti~57K serum and more 
weakly reacting bands with smaller molecular masses. ALK03568 
specifically expressed bands recognized by anti-93K serum at 
15 about 60, 3 6 and 35 kDa (strong), suggesting that extensive 

degradation of the long chain occurs in ALK03568 or "that TSL1 . 
is not correctly transcribed and translated* 

These results disclose (l) TPS activity can be transferred to 
20 heterologous cells by either TSS1 or TSL1, (2) a TSS1 gene 
product has TPS activity and (3) also one' or more (degraded) 
products of TSL1 has TPS activity. This latter finding is 
unexpected, because yeast containing a defective (Example 7) or 
disrupted (Example 14) TSS1 gene lack TPS activity. Possibly 
25 ALK03568 accumulates fortuitously degraded proteolytic products 
of the 123 kDa long chain of trehalose synthase that exhibit 
TPS activity even in the absence of the TSS1 product. 

Obviously, transformation with TSS1 (or TSL1) alone can be used 
30 to introduce a trehalose synthetic pathway to an organism, such 
as E. coli HB101, that already has the capacity to generate 
trehalose from trehalose-6-phosphate, possibly via a non- 
specific phosphatase. 
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Example 14. Trans formation of Yeast 



(1) Assembly of complete genes and truncated versions of TSL1. 

3 

5 Plasmids comprising the complete ORFs of TSS1 and TSL1 and a 

truncated ORF of TSL1 were assembled from appropriate i 
immunopositive clones of the HaelXI and EcoRI libraries used in 
Examples 4 and 5 to sequence these genes: 

10 (a) The TSS1 gene with its promoter (pALK 752) 

A 516 bp fragment was cut from Haelll clone 7 with restriction 
enzymes DraX and BstEII (see Fig 6 for restriction sites). The 
DraX site marks the beginning of the disclosed TSS1 sequence. 
15 This fragment was joined to HaeXXX clone 20 after this had been 
digested with BstEII and Clal (the Clal site was in the 
polylinker) and the Clal end filled with Klenow fragment* The 
sequence at the junction at the BstEII site in the religated 
plasmid (shown in Fig 20a) was confirmed by sequencing. 

20 

fb\ The TSS1 gene without its promoter (pALK75m 

Haelll clone 21 was cut with the restriction enzyme Tthllll. To 
this site the following linker (SEQ ID NO: 84, synthesized with 
25 the ABI DNA Synthesizer) was added: 

5 ' -CGGGAAGACA TAGAACTATG ACTACGGATA ACGCTAAGGC GCAACTGACC -3' 
3 ' -GCCCTTCTGT ATCTTGATAC TGATGCCTAT TGCGATTCCG CGTTGACTGG -5' 

This includes nucleotides -13 to +33 of TSS1 (see Fig. 4) but, 
30 when correctly orientated, introduces a Smal site at nucleotide 
-16 from the ATG start site. The plasmid (shown in Fig 20b) can 
be used to release with Smal the ORF of the TSS1 gene and about 
200 bp of its terminator for further constructions (e.g. 
expression vectors containing a new promoter) . 

35 
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(c) The TSL1 gene with its Promoter fpALK754) 



10 



EcoRI clone 10 was cut with the restriction enzymes Mlul and 
Ndel, and the resulting 4.4 kb fragment was religat d into the 
pBluescript Smal site. This procedur d stroyed all these 
sites, so that these restriction enzymes cannot be used in 
further manipulations. The plasmid is shown in Fig 20c. 

(d) The TSL1 gene without its promoter fpALK7571 



Primers for the polymerase chain reaction (PGR) were made 
against the beginning of the TSL1 gene and the sequence at 
+318. PGR (Techne PHC-2 Heat/Cool Dri-Block R ) was used to 
synthesize (at 55 °C) a 325 bp fragment, which had at one end a 

15 Spel site and close to the other end a BsmI site. This fragment 
was digested with BsmI and can be ligated to pALK754 after 
cutting the latter with Spel (at the site in the pBluescript 
polylinker) and BsmI and filling the Spel site with Klenow 
fragment. For further manipulations, the gene can be isolated 

20 by cutting the resulting plasmid with Spel and, for example, 
Clal. 



(&\ A truncated TSL1 gene 



25 A truncated version of TSL1 can be made by cutting pALK754 with 
StuI and joining the following linker (SEQ ID NO: 85) to this 
site: 

5 r -GGGCCCAACA ACACAATGGT TACCCCGAAA TCGAGGGCGG GCAACAGG -3' 
3" ' -CCCGGGTTGT TGTGTTACCA ATGGGGCTTT AGCTCCCGCC CGTTGTCC -5' 

30 

The linker recreates the StuI site and creates a new ATG start 
codon at +627 in frame with the coding sequence. Thus, this 
version of the gene encodes a truncated 123 )cDa long chain 
lacking the first 209 amino acids. It. was disclosed in Example 
35 10 that removal of the first 325 or so amino acids proceeds 
without loss of catalytic activity, but r leases trehalose 
synthase from strong inhibition by phosphate and the 
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requirement for F6P. Hence , this construction can encode a 
truncated 123 kDa polypeptide leading to a trehalose synthase 
with increased activity at physiological phosphate and F6P 
cone ntrations. A new Smal site is included in the linker. The * 
5 sequ nee flanking the new ATG on th 5 ' -side r sembles the 

original ATG f lanking sequence and the surrounding nucleotides \ 
are in accordance with the sequences known to occur most 
frequently at positions -7 to +4* 

10 f2) Disruption mutants . 

The TSS1 gene was disrupted to confirm that it is an essential 
gene in trehalose synthesis. The one-step gene disruption 
method [Rdthstein, R.J. (1983) Methods in Enzymology 101, 202- 
15 211)] was used as. follows: 

Plasmid pALK752 was- cut with Xcml. A blunted Sall-Xhol fragment 
containing the LEU2 gene from plasmid yEpl3 [Broach , J et al 
(1979) Gene 8 , .121-133] was ligated to the blunted Xcml site. 

20 The resulting plasmid was cut with Nsil and Pvul and the 
reaction mixture was run through a 0.8% low-melting point 
agarose gel. A band of 4 kb was excised from the gel and 
purified. S. cerevisiae strain S150-2B was transformed (using 
the one-step alkali-cation method of Chen et al [(1992) Current 

25 Genetics 21, 83-84]) with the 4 kb DNA fragment containing the 
TSS1 gene interrupted by the LEU2 gene. Leu* transformants were 
selected on minimal plates lacking leucine and containing 
glucose or galactose , and the clones obtained were then grown 
on YPD or YP/2% galactose, respectively. 

30 

As expected the phenotype of the disruptants resembled the fdpl 4 
and cifl phenotypes (see Example 2) . Only one transformant 
(ALKQ3569) was isolated on glucose and the several % 
transformants isolated on galactose were unable to grow on 
35 glucose. The glucose transformant and the tested galactose 
transformant (ALKO3570) did not accumulate trehalose in 
stationary phase (<0.2 % of dry wt.), lacked TPS and had low 
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TPP activity 10 % of vild type) • The 57 kDa band could not 
be seen on Western blots. Southern analysis (Fig 21) showed 
that the TSS1 g ne had been disrupt d by a LEU2 g ne, but the 
TSL1 gene was intact. 

5 

Another mutant, WDC-3A (see Table 1) with a disrupted TSS1 gene 
was obtained from the laboratory of Dr. C. Gancedo (Instituto 
de Investigaciones Biomedicas, CSIC, Madrid/ Spain) as a 
cifl::HIS3 disrupt ant. This mutant was easier to transform than 

10 were the tssl::LEU2 disruptants r and so it was used to confirm 
that the TSS1 gene on a plasmid can confer TPS and TPP 
activities, trehalose accumulation and improved stress 
resistance. WDC-3A was transformed with the plasmid pMB4 (see 
Table 1; the plasmid contains an intact CIF1 a TSS1 gene and a 

15 selectable URA3 marker) and transf ormants selected in the 

absence of uracil. Western analyses (not shown) indicated that 
the transf Ormants has acquired the 57 kDa band absent from WDO 
3A. The parent and a transf ormant were grown in parallel in 
minimal medium containing 2 % galactose and (transf ormant) no 

20- uracil or (parent) uracil. Duplicate cultures of each strain 
were harvested in early stationary phase after 28 h growth 
samples taken for studies of stress resistance', and the rest 
used for trehalose and enzyme assays (Table 11) . 

25 Table 11, Analysis of WDC-3A and its PMB14 transf ormant . 

Duplicate cultures were analyzed separately for trehalose and 
combined for enzyme assays. TPS activities were corrected for 
UDPGase activity in the absence of G6P and F6P. 

30 



Cell mass (g/100 ml medium) 
Trehalose (* of dry wt«) 
5 Whole horn aenates 

TPP (U/g fresh yeast) 
TPS (U/g fresh yeast) 
. 28 r 000 a supematants 
TPP (U/g fresh yeast) 
10 TPS (U/G fresh yeast) 
TPS (mU/mg protein) 



These results disclose that introduction of TSS1 on a plasmid 
15 can restore both TPS and TPP activities and increase the 

trehalose content of an organism. The TPP /TPS ratio (5 %) is 
much lower than that (about 35 %) of purified trehalose 
synthase whereas the baker' yeast used in Example 1 and the 
X2180 used in Example 2 both have TPP/TPS ratios in their 
20 homogenates close to that of pure enzyme* This suggests that 
transformation with TSS1 in pMB14 increases TPP only up to a 
limit set by the genetic background of the host (probably the 
amounts of 99 and 123 kDa polypeptides present) but causes a 
larger increase in TPS due to activity associated also with 57 
25 kDa chains not incorporated into the trehalose synthase 
complex. 

Samples of the transformant and host were frozen in water at 1 
jig yeast/ml and kept for 5 days at -20 °C. The viability was 
30 then tested on plates containing YP/2 % galactose. After 

freezing stress , 1.0 ± 0.1 % of the transformants and < 0,05 % 
of the host cells were viable. These results disclose that 
transformation of an organism with TSSl can increase its 
resistance to freezing-stress. 

35 
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WDC-3A 
ftssl::HIS3) 

2.6 
0.84, 0.81 

0.02 

0.84 ± 0.37 
0.01 

0.22 ± 0.20 
4.0 



p»B14 fTSSl) 
yransgogmant 
2.8 

2.9, 3.0 * 

0.84 * 
17.9 ± 3.5 

0.67 
14,6 ± 3.3 
223 
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(3) Strategies for transformation. 

Laboratory strains of S. cer visiae bearing auxotrophic markers 
such as his3, 1 u2, lys 2, trpl and ura3 can be easily 
5 transformed with the trehalose synthase genes by essentially 
the same methjods described for transformation of tssl 
disruptants with TSSl. Versions of the genes in which the 
natural promoters and terminators are intact or have been 
replaced by (stronger and regula table) promoters and 

10 terminators from other yeast genes can be used. For example, 
PGKI [pMA91; Mellor et al (1983) Gene 24, 1-14] , ADC1 [pAAH5; 
Ammerer (1983) Methods in Enzymology 101, 192-201] and MEL1 
[pALK3537, pALK41, etc, Suominen, P.L. (1988) Doctoral 
dissertation, University of Helsinki] systems have been used to 

15 increase the expression levels of genes in S. cerevisiae and 
other yeast. The MEL1 system has the advantage that the 
expression can be regulated, being repressed by glucose and' 
induced by galactose. Standard vectors are available [episomal 
and integrating and centromere yeast plasmids are reviewed by 

20 Rose & Broach (1990) Methods in Enzymology 185, 234-279 and 

Stearns, T., Ma, H & Botstein, D. (1990) Methods in Enzymology 
185, 280-291] that incorporate auxotrophic markers such as / 
HIS3, LEU2, TRP1 and URA3 , which can be used to select the 
transf ormants . Vectors based on these principles, but suited 

25 to a particular task can be constructed by a person familiar 
with the art. 

The basic strategy is to leave the yeast with an intact version 
of its natural genes for trehalose synthase and introduce, 

30 either on episomes or integrated into a yeast chromosome, extra 
copies of the genes. These may be under control of their own 
promoters, or of stronger promoters and promoters that can be 
regulated, for example by adding substances such as galactose 
to the growth medium or by changing the temperature. The use of 

35 such promoters has been described [see, e.g., Mylin et al. 

(X990) Methods in Enzymology 185, 297-308; Sledziewski et al. 
(1990) Methods in Enzymology 185, 351-366]. This strategy 
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avoids problems that can be foreseen if all copies of the genes 
genes are put under tight control (such as the defects in sugar 
catabolism exp cted if TSS1 is not properly expressed; see 
Example 7.) Transformed yeast bearing additional copies of the i 
5 genes with their natural promoters may accumulate enough 

trehalose to exhibit the desired improvement in stability . They • 
may also cycle enough glucose units through trehalose during 
fermentative conditions to generate an ATPase that accelerates 
fermentation and increases the yield of ethanol on glucose* 

10 Alternatively , the promoters of one or more genes can be 

changed to promoters that are more active under fermentative 
conditions* In another aspect of the invention , copies of the 
ORFs of the genes can be inserted into expression vectors 
eguipped with powerful promoters (that may be regulatable) to 

15 cause still larger increases in trehalose. This can be 
particularly useful for the production of trehalose. 

Transforming yeast with two or all three genes can be achieved 
in several ways. The most obvious procedure is to use different 

20 auxotrophic markers and introduce the genes sequentially. 

Another method is to construct a Yip containing DRA3 and a 
modified version of, say, TSLl with a stronger promoter but 
still containing a region of homology upstream of this 
promoter. After directed integration of this plasmid to the 

25 chromosomal ura3 site and selection of URA+ transf ormants, 
mutants in which the URA3 has again been excised (with a 
frequency of about 1 x 10" 4 ) can be selected by growth on media 
containing 5-fluoroorotic acid [see Stearns et al. (1990) loc. 
cit. ] . Some of the selected cells would contain a new version 

30 of the gene, with the stronger promoter and can again be 

transformed, this time with, say, a modified TSS1 gene. The 
resultant transf ormants will contain one copy of TSLl driven by 
the new promoter, and two copies of TSS1, one of which is still * 
under the control of its natural prompter. Thirdly, a Yip 

35 containing two or all three genes can be used to introduce the 
genes in a single step. 
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Various methods to transform industrial, polyploid yeast, which 
lack auxotrophic markers have been described in the literatur • 
Earlier methods have been reviewed by Knowles, J.K.C. & Tubb, 
R.S. [(1987) E.B.C. symposium on brewer's yeast, Helsinki, 
1986. Monograph XII 169-185] and include the use of marker 
genes that confer resistance to antibiotics, methylglyoxal , 
copper, cinnamic acid and other compounds. These markers 
facilitate selection of transf ormants. Some of the marker genes 
are themselves of yeast origin, and so are preferred for 
acceptability reasons. When suitable modifications and 
combinations of the genes have been identified by using 
laboratory yeast, they may be transferred to industrial yeast 
using these procedures or others described in the literature, 
such as co-transformation with pALK2 and pALK7 [Suominen, P. I. 

(1988) loc. cit.].. These plasmids contain a readily selectable 
MEL1 marker gene on a 2 /i-based plasmid that can readily be 
cured, thus facilitating seguential transformation with more 
than one gene if it is not practicable to introduce the 
modified genes in one step using this co-transformation 
procedure, 

Example 15. Transformation of crop plants . 

Methods for the transformation of higher plants, including crop 
plants of economic importance, have been described [Goodman at 
al (1987) Science 236, 48-54; Weising et al, (1988) Annual 
Review of Genetics 22, 421-477; Glasser & Fraley (1989) Science 
244, 1293-1299; Lindsey, K. (1992) Journal of Biotechnology 26, 
1-28] and laboratory manuals setting, out standard procedures 
are available such as the Plant Molecular Biology Manual [ed. 
Gelvin & Schnilperoort (1988) Kluwer Academic Press}* Of 
particular utility is the use of tissue specific promoters from 
•the genes of proteins that are expressed in a highly 
tissue-specific manner [see, e.g., Higgins (1984) Annual Review 
of Plant Physiology 35, 191 et sea . : Shotwell and Larkins 

(1989) in The Biochemistry of Plants 15, 297 et seg. 1 . The use 
of such promoters will allow the expression of trehalose 
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synthase in (a) sp cifically the frost- and drought-sensitive 
tissues of plants so that they may be protected from these and 
equivalent stresses without diverting carbohydrate metabolism 
in the major storage tissues, or alternatively (b) precisely in 
th edible tissues. Th purpose of this s cond alternative is 
to cause the accumulation of trehalose in products such as the 
fruit of tomatoes, in order to increase the shelf -life of these 
products. The expression of non-plant genes, with higher A+T 
contents than are commonly found in plant genes can generally 
be improved by changing the codons to increase the G+C content, 
and in particular to avoid regions of overall high A+T content 
Perlak et al. (1991) loc. cit.}. It is foreseen that such 
modifications will be beneficial in the case of the genes TSS1 
and TSL1, which have A+T-rich regions. Selection systems are 
available for use in the transformation of higher plants, 
including plasmids comprising the gene (hpt) for hygromycin 
phosphotransferase [Dale & Ow (1991) Proceedings of the 
National Academy of Sciences, USA 88, 1055810562]. These and 
similar methods familiar to persons skilled in the art* can be 
used, first to introduce various modifications of the yeast 
trehalose synthase genes into Arabidoosis thaliana . and then to 
transf er the most successful modifications to plants of 
economic importance. 

One example of how one. would transform a crop plant (dicots and 
some monocots) is via a Ti plasmid. A large fragment of the Ti 
plasmid encompassing both the T-DNA and vir regions is first 
cloned into the common bacterial plasmid pBR322. One or more of 
the trehalose synthase genes are then cloned into a non- 
essential region of the T-DNA and introduced into Aorobacterium 
tumefaciens carrying an intact Ti plasmid. The plants are then 
infected with these bacteria and the gene products of the vir 
region on the intact Ti plasmid mobilize the recombinant T-t)NA, 
and the recombinant T-DNA integrates into the plant geome. One 
or more of the trehalose synthase genes can be introduced into 
the plant in this manner, by inserting the genes into either 
the same plasmid or separate plasmids. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(1) APPLICANT: 

(A) NAME: ALKO LTD. (et al.) 

(B) STREET: Salmisaarenranta 7 

(C) CITY: Helsinki 

(D) COUNTRY: FINLAND 

(E) POSTAL CODE: SF-00100 

(ii) TITLE OF INVENTION: Increasing the trehalose content of 

organisms by transforming them with 
combinations of the structural genes 
for trehalose synthase. 

(iii) NUMBER OF SEQUENCES: 85 

(iv) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US / 07/836,021 

(B) FILING DATE: 14 February 1992 

(A) APPLICATION NUMBER: US / 07/841,997 

(B) FILING DATE: 28 February 1992 



(2) INFORMATION FOR SEQ ID NO si: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: Doublestranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI-SENSE: no 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: S288C 

(E) HAPLOTYPE: Haploid 

(vii) IMMEDIATE SOURCE 

(A) LIBRARY: Genomic 

(B) CLONE: 20 

(viii) POSITION IN GENOME: 
(A) CHROMOSOME: 2R 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



TTTTTAAACG TATATAGATG TCTACATGTG TGTTTTTGTT TTTTTACGTA 50 
5 CGTATACCCA CCTATATATG CATAATCCGT AATTGAAAAA AAAAAAAGTA 100 
AAGATCAAGG AACACATCAC CCTGGGCACA TCAAGCGTGA GGAATGCCGT 150 
CCAACTGGTG GAGACGCTTG ATTTGCTCTT TTTGTTCCTG GGTCCAACCC 200 
GGTCTCGAAG AACATCAGCA CCACGCCCGC AACGACAAAG AACATTGCAA 250 
TACACTTGCA TATGTGAGCA TAGTCGAGCG GTCCGTTCTG TGGTTGATGC 300 

10 TGTTGTTCTT TCTTCTGTTT GTCAGGGGTG ATAGCCATAT CTTCGTGCTC 350 
TTGTTGCGAT TGTTCTGTTC CATCTGCACC AGAACAAAGA ACAAAAGAAC 400 
AAGGAACAAA GTCCAAGCAC GTCAGCGCTG TTTATAAGGG GATTGACGAG 450 
GGATCGGGCC TAGAGTGCCA GCGCGCCAGG GAGAGGGAGC CCCCTGGGCC 500 
CTCATCCGCA GGCTGATAGG GGTCACCCCG CTGGGCAGGT CAGGGCAGGG 550 

15 GCTCTCAGGG GGGCGCCATG GACAAACTGC ACTGAGGTTC TAAGACACAT 600 
GTATTATTGT GAGTATGTAT ATATAGAGAG AGATTAAGGC GTACAGCCGG 650 
GTGGTAGAGA TTGATTAACT TGGTAGTCTT ATCTTGTCAA TTGAGTTTCT 700 
GTCAGTTTCT TCTTGAACAA GCACGCAGCT AAGTAAGCAA CAAAGCAGGC 750 
TAACAAACTA GGTACTCACA TACAGACTTA TTAAGACATA GAACTATGAC 800 

20 TACGGATAAC GCTAAGGCGC AACTGACCTC GTCTTCAGGG GGTAACATTA 850 
TIGTGGTGTC CAACAGGCTT CCCGTGACAA TCACTAAAAA CAGCAGTACG 900 
GGACAGTACG AGTACGCAAT GTCGTCCGGA GGGCTGGTCA CGGCGTTGGA 950 
AGGGTTGAAG AAGACGTACA CTTTCAAGTG GTTCGGATGG CCTGGGCTAG 1000 
AGATTCCTGA CGATGAGAAG GATCAGGTGA GGAAGGACTT GCTGGAAAAG 1050 

25. TTTAATGCCG TACCCATCTT CCTGAGCGAT GAAATCGCAG ACTTACACTA 1100 * 
CAACGGGTTC AGTAATTCTA TTCTATGGCC GTTATTCCAT TACCATCCTG 1150 
GTGAGATCAA TTTCGACGAG AATGCGTGGT TCGGATACAA CGAGGCAAAC 1200 
CAGACGTTCA CCAACGAGAT TGCTAAGACT ATGAACCATA ACGATTTAAT 1250 
CTGGGTGCAT GATTACCATT TGATGTTGGT TCCGGAAATG TTGAGAGTCA 1300 

30 AGATTCACGA GAAGCAACTG CAAAACGTTA AGGTCGGGTG GTTCCTGCAC 1350 
ACACCATTCC CTTCGAGTGA AATTTACAGA ATCTTACCTG TCAGACAAGA 1400 
GATTTTGAAG GGTGTTTTGA GTTGTGATTT AGTCGGGTTC CACACATACG 1450 . 
ATTATGCAAG ACATTTCTTG TCTTCCGTGC AAAGAGTGCT TAACGTGAAC 1500 
ACATTGCCTA ATGGGGTGGA ATACCAGGGC AGATTCGTTA ACGTAGGGGC 1550 

35 CTTCCCTATC GGTATCGACG TGGACAAGTT CACCGATGGG TTGAAAAAGG 1600 
AATCCGTACA AAAGAGAATC CAACAATTGA AGGAAACTTT CAAGGGCTGC 1650 
AAGATCATAG TTGGTGTCGA CAGGCTGGAT TACATCAAAG GTGTGCCTCA 1700 
GAAGTTGCAC GCCATGGAAG TGTTTCTGAA CGAGCATCCA GAATGGAGGG 1750 
GCAAGGTTGT TCTGGTACAG GTTGCAGTGC CAAGTCGTGG AGATGTGGAA 1800 

40 GAGTACCAAT ATTTAAGATC TGTGGTCAAT GAGTTGGTCG GTAGAATCAA 1850 
CGGTCAGTTC GGTACTGTGG AATTCGTCCC CATCCATTTC ATGCACAAGT 1900 
. CTATACCATT TGAAGAGCTG ATTTCGTTAT ATGCTGTGAG CGATGTTTGT 1950 
TTGGTCTCGT CCACCCGTGA TGGTATGAAC TTGGTTTCCT ACGAATATAT 2000 
TGCTTGCCAA GAAGAAAAGA AAGGTTCCTT AATCCTGAGT GAGTTCACAG 2050 

45 GTGCCGCACA ATCCTTGAAT GGTGCTATTA TTGTAAATCC TTGGAACACC 2100 
GATGATCTTT CTGATGCCAT CAACGAGGCC TTGACTTTGC CCGATGTAAA 2150 
GAAAGAAGTT AACTGGGAAA AACTTTACAA ATACATCTCT AAATACACTT 2200 
CTGCCTTCTG GGGTGAAAAT TTCGTCCATG AATTATACAG TACATCATCA 2250 
AGCTCAACAA GCTCCTCTGC CACCAAAAAC TGATGAACCC GATGCAAATG 2300 

50 AGACGATCGT CTATTCCTGG TCCGGTTTTC TCTGCCCTCT CTTCTATTCA 2350 
CTTTTTTTAT ACTTTATATA AAATTATATA AATGACATAA CTGAAACGCC 2400 
ACACGTCCTC TCCTATTCGT TAACGCCTGT CTGTAGCGCT GTTACTGAAG 2450 
CTGCGCAAGT AGTTTTTTCA CCGTATAGGC C 2481 



55 
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(3) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 495 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: Polypeptide 
10 (iii) HYPOTHETICAL: Yes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
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(4) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS 
30 (A) LENGTH: 3000 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Doublestranded 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

40 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: S288C 

(E) HAPLOTYPE: Haploid 

45 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Genomic 

(B) CLONE: 6 

50 (vii) POSITION IN GENOME 
(A) CHROMOSOME: 13 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
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CCTCCTCTGG ATCTTCTGGG TCTTCTGCGC CACCTTCCAT TAAAAGGATT 50 

ACGCCCCACT TGACTGCGTC TGCTGCAAAA CAGCGTCCCT TATTGGCTAA 100 

ACAGCCTTCT AATCTGAAAT ATTCGGAGTT AGCAGATATT TCGTCGAGTG 150 

AGACGTCTTC GCAGCATAAT GAGTCGGACC CGGATGATCT AACTACTGCC 200 

5 CTGACGAGGA TATGTTTCTG ATTAGGAATT GATGACGCGA GAGGACTACA 250 

* AGGTTCAAAG TTCGGCGCTA TTCATAAATC AACTAAGAAA TATGCGCTGT 300 
TAAGGTCATC TCAGGAGCTG TTTAGCCGTC TTCCATGGTC GATCGTTCCC 350 
TCTATCAAAG GTAATGGCGC CATGAAGAAC GCCATAAACA CTGCAGTCTT 400 

* GGAGAATATC ATTCCGCACC GTCATGTTAA GTGGGTCGGT ACCGTCGGAA 450 
10 TCCCAACGGA TGAGATTCCG GAAAATATCC TTGCGAACAT CTCTGACTCT 500 

TTAAAAGACA AGTACGACTC CTATCCTGTC CTTACGGACG ACGACACCTT 550 

CAAAGCCGCA TACAAAAACT ACTGTAAACA AATCTTGTGG CCTACGCTGC 600 

ATTACCAGAT TCCAGACAAT CCGAACTCGA AGGCTTTTGA AGATCACTCT 650 

TGGAAGTTCT ATAGAAACTT AAACCAAAGG TTTGCGGACG CGATCGTTAA 700 

15 AATCTATAAG AAAGGTGACA CCATCTGGAT TCATGATTAC CATTTAATGC 750 

TGGTTCCGCA GATGGTGAGA GACGTCTTGC CTTTTGCCAA AATAGGATTT 800 

ACCTTACATG TCTCGTTCCC CAGTAGTGAA GTGTTTAGGT GTCTGGCTCA 850 

GCGTGAGAAG ATCTTAGAAG GCTTGACCGG TGCAGACTTT GTCGGCTTCC 900 

AGACGAGGGA GTATGCAAGA CATTTCTTAC AGACGTCTAA CCGTCTGCTA 950 

20 ATGGCGGACG TGGTACATGA TGAAGAGCTA AAGTATAACG GCAGAGTCGT 1000 

TTCTGTGAGG TTCACCCCAG TTGGTATCGA CGCCTTTGAT TTGCAATCGC 1050 

AATTGAAGGA TGGAAGTGTC ATGCAATGGC GTCAATTGAT TCGTGAAAGA 1100 

TGGCAAGGGA AAAAACTAAT TGTGTGTCGT GATCAATTCG ATAGAATTAG 1150 

AGGTATTCAC AAGAAATTGT TGGCTTATGA AAAATTCTTG GTCGAAAATC 1200 

25 CGGAATACGT GGAAAAATCG ACTTTAATTC AAATCTGTAT TGGAAGCAGT 1250 

AAGGATGTAG AACTGGAGCG CCAGATCATG ATTGTCGTGG ATAGAATCAA 1300 

CTCGCTATCC ACCAATATTA GTATTTCTCA ACCTGTGGTG TTTTTGCATC 1350 

AAGATCTAGA TTTTTCTCAG TATTTAGCTT TGAGTTCAGA GGCAGATTTG 1400 

TTCGTAGTCA GCTCTCTAAG GGAAGGTATG AACTTGACAT GTCACGAATT 1450 

30 TATCGTTTGT TCTGAGGACA AAAATGCTCC CCTACTGTTG TCAGAATTTA 1500 

CTGGTAGTGC ATCTTTATTG AATGATGGCG CTATAATAAT TAACCCATGG 1550 

GATACCAAGA ACTTCTCACA AGCCATTCTC AAGGGGTTGG AGATGCCATT 1600 

CGATAAGAGA AGGCCACAGT GGAAGAAATT GATGAAAGAC ATTATCAACA 1650 

ACGACTCTAC AAACTGGATC AAGACTTCTT TACAAGATAT TCATATTTCG 1700 

35 TGGCAATTCA ATCAAGAAGG TTCCAAGATC TTCAAATTGA ATACAAAAAC 17 6 0 

ACTGATGGAA GATTACCAGT CATCTAAAAA GCGTATGTTT GTTTTCAACA 1800 

TTGCTGAACC ACCTTCATCG AGAATGATTT CCATACTGAA TGACATGACT 1850 

TCTAAGGGCA ATATCGTTTA CATCATGAAC TCATTTCCAA AGCCCATTCT 1900 

GGAAAATCTT TACAGTCGTG TGCAAAACAT TGGGTTGATT GCCGAGAATG 1950 

40 GTGCATACGT TAGTCTGAAC GGTGTATGGT ACAACATTGT TGATCAAGTC 2000 

GATTGGCGTA ACGATGTAGC CAAAATTCTC GAGGACAAAG TGGAGAGATT 2050 

ACCTGGCTCG TACTACAAGA TAAATGAGTC CATGATCAAG TTCCACACTG 2100 

AAAATGCGGA AGATCAAGAT CGTGTAGCTA GTGTTATCGG TGATGCCATC 2150 

ACACATATCA ATACTGTTTT TGACCACAGA GGTATTCATG CCTACGTTTA 2200 

* 45 CAAAAACGTT GTTTCCGTAC AACAAGTGGG ACTTTCCTTA TCGGCAGCTC 2250 

AATTTCTTTT CAGATTCTAT AATTCTGCTT CGGATCCACT GGATACGAGT 2300 

TCCGGCCAAA TCACAAATAT TCAGACACCA TCTCAACAAA ATCCTTCAGA 2350 

4 TCAAGAACAA CAACCTCCAG CCTCTCCCAC TGTGTCGATG AACCATATTG 2400 

ATTTCGCATG TGTCTCTGGT TCATCGTCTC CTGTGCTTGA ACCATTGTTC 2450 

50 AAATTGGTCA ATGATGAAGC AAGTGAAGGG CAAGTAAAAG CCGGACACGC 2500 

CATTGTTTAT GGTGATGCTA CTTCTACTTA TGCCAAAGAA CATGTAAATG 2550 

GGTTAAACGA ACTTTTCACG ATCATTTCAA GAATCATTGA AGATTAAATT 2600 

TTACCATTTT AAAATTTTAA TGTTCTTGGG TATGAACTTT TATTTTCAAC 2650 

TGCTTATTAT ATATCAATTC TATAAATTTT TTTCTTCTCT CTAACGACCA 2700 

55 ATTATAAAAT TCATCCTCTT ATTTATTACA GCATCTTATA CATTATGTAT 2750 

ATGGGTAGCT ATTATTCATT TTTGCTTCGT AAGGACTTTT TTTGTCAACT 2800 
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TTTTCATCCT AA6CGGCTAA AA6T6ATT6G AGAGGAATGT CCAGGCGACC 2850 
AATGATAAAA ACGCTTTCTC TTGGAACAAG AAATAGGAGC AATTGACAGT 2900 
TGTCGATGAA CAGCGAAAAT AGTAAGATAA CCTTCAAGCC CAATATTCTA 2950 
ATTAAAGGCG TTTATATATT TGTACTTTAT GGTATGTGCA TATGTATTGT 3000 

5 

t 

(5) INFORMATION FOR SEQ ID NO: 4 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 amino acids 

(B) TYPE: Amino acid 
CD) TOPOLOGY: Linear 

15 (ii) MOLECULAR TYPE: Polypeptide 

(iii) HYPOTHETICAL: Yes 

(v) FRAGMENT TYPE: C-terminal 

"20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 





Arg 


Gly 


Leu 


Gin 


Gly 


Ser 


Lys 


Phe 


Gly 


Ala 


He His Lys Ser Thr 


25 










5 










10 . 


15 




Lys 


Lys 


Tyr 


Ala 


Leu 
20 


Leu 


Arg 


Ser 


Ser 


Gin 
25 


Glu Leu Phe Ser Arg 

30 


• 


Leu 


Pro 


Trp 


Ser 


He 
35 


Val 


Pro 


Ser 


He 


Lys 
40 


Gly Asn Gly Ala Met 

45 


30 


Lys 


Asn 


Ala 


He 


Asn 
50 


Thr 


Ala 


Val 


Leu 


Glu 
55 


Asn He He Pro His 

60 




Arg 


His 


Val 


Lys 


Trp 
65 


Val 


Gly 


Thr 


Val 


Gly 
70 


He Pro Thr Asp Glu 

75 




lie 


Pro 


Glu 


Asn 


He 


Leu 


Ala 


Asn 


He 


Ser 


Asp Ser Leu Lys Asp 


35 










80 










85 


90 




Lys 


Tyr 


Asp 


Ser 


Tyr 
95 


Pro 


Val 


Leu 


Thr 


Asp 
100 


Asp Asp Thr Phe Lys 

105 




Ala 


Ala 


Tyr 


Lys 


Asn 


Tyr 


Cys 


Lys 


Gin 


He 


Leu Trp Pro Thr Leu 




His 








110 










115 


120 


40 


Tyr 


Gin 


He 


Pro 


Asp 


Asn 


Pro 


Asn 


Ser 


Lys Ala Phe Glu Asp 




His 








125 










130 


135 




Ser 


Trp 


Lys 


Phe 
140 


Tyr 


Arg 


Asn 


Leu 


Asn 
145 


Gin Arg Phe Ala Asp 

150 




Ala 


He 


Val 


Lys 


He 


Tyr 


Lys 


Lys 


Gly 


Asp 


Thr He Trp He His 


45 










155 










160 


165 




Asp 


Tyr 


His 


Leu 


Met 
170 


Leu 


Val 


Pro 


Gin 


Met 
175 


Val Arg Asp Val Leu 

180 




Pro 


Phe 


Ala 


Lys 


He 
185 


Gly 


Phe 


Thr 


Leu 


His 
190 


Val Ser Phe Pro Ser 

195 


50 


Ser 


Glu 


Val 


Phe 


Arg 
200 


Cys 


Leu 


Ala 


Gin 


Arg 
205 


Glu Lys He Leu Glu 

210 




Gly 


Leu 


Thr 


Gly 


Ala 

215 


Asp 


Phe 


Val 


.Gly 


Phe 
220 


Gin Thr Arg Glu Tyr 

225 




Ala 


Arg 


His 


Phe 


Leu 


Gin 


Thr 


Ser 


Asn 


Arg 


Leu Leu Met Ala Asp 


55 










230 










235 


240 




Val 


Val 


His 


Asp 


Glu 


Glu 


Leu 


Lys 


Tyr 


Asn 


Gly Arg Val Val Ser 
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245 








Val 

V O.X 




Phe 


Thr Pro 


Val 


Glv 










260 








Gin 


Leu 


Lvs 


Asd Glv 


Ser 


Val 


5 








275 








Glu 


Arcf 


XX£/ 


Gin Gly 


Lvs 


Lvs 










290 








Asp 




Tie 

lie 


Atct Glv 

AX y wX Jr 


lie 

XX w 


His 

11X0 










TOR 






10 


File 


Leu 


Val 




riu 












320 








V7lil 


Tip 

lie 




Tie Glv 

lie wAjr 


Cpr 

0CX 












335 








Tie 


Met 


Tie 

1 1 w 


Val Val 

vux vox 


ASTJ 

A*0£/ 




15 








350 








Cot 
0 cx 


Tie 

IXC 


Seir 

0eX 


Gin Pro 


Val 


Val 










365 








weX 


Girt 




t.ou Ala 


Leu 


Ser 










380 






20 


wex 


Cp"r 

0eX 


Leu 


At*ct Glu 


Glv 


Met 










395 
*j ^ *j 








veil 


v#ys 


Cor 
ocx 


fllii A en 

U1U AO£J 


T.VR 
Xijr o 


Asn 










410 










Glv 

VJX jf 


Ser* 

OCX 


Ala Sei* 


Leu 


Leu 


25. 






'425 








* X w 


TT*n 

Air P 


AO 


Tht* Lvs 


Asn 

A014 


Phe 

xr xxe 










440 








Glu 

\7X U 


Met 




Phe Astj 

flic no^ 


Lys 












455 








Lvs 

XI jr a 


AO 


Tie 
lie 


Tie Asn 

xxe nou 


Asn 


Asn 










470- 








lieU 




Asp 


T 1 o Hie 
lies Jxlo 


Tl o 
116 


Cor 
Dei 
















Xjyo 


T1 o 
116 


.file 


T.Y7C T.Oll 

Jjjro ijcU 


AO il 


Tnr 

JL ilx 


35 






500 








Sot* 

k?ei 


Sot* 

w eX 


XI jr O 


Lvs Ay* a 

XI jr 9 AX 


Met 


Phe 

xr xxe 










515 








Set" 


Car 

mCJ. 


Allj 


Met lie 

1 4 w l« X X<e 


Sei* 

*jex. 


lie 

xxe 








530 






40 


Asn 


Tie 

XXe 


Val 

V CX X 


Tvtt Tie 

ijf x xxe 


Met 

lie w 


Asn 










545 








Asn 


Lou 




Cor Ar/a 


Val 


Gin 

XI* 










560 








f3l v 
u x y 


Al A 
Ala 


x y x 


Va 1 Ser* 

V CIX wel 


T.on 


A fin 

ABU 


45 








575 








OX 11 


Val 

V CIX 




11^ AX %j 


Aen 
nan 


Asn 

AO by 










590 








Val 

V CIX 


Glu 

W X U 


ax y 


Leu Pt*o 

UCU 1TXW 


Glv 
wiy 


Cor 
ex. 










605 






50 ' 


Tie 

X X w 


LVS 


Phe 

It lie 


His Thr 
mo xxix 


Glu 

wx u 


Asn 










620 








oex 


Val 

V CIX 


Tie 

1 1 w 


Glv Aso 


Ala 

nxu 


Tie 

lie 










635 








His 


Arg 


Gly 


lie His 


Ala 


Tyr 


55 








650 








Gin 


Gin 


Val 


Gly Leu 


Ser 


Leu 



250 255 



He 


Asp 


Ala 
265 


Ph 


Asp 


Leu 


Gin 


Ser 
270 


Met 


Gin 


Trp 
280 


Arg 


Gin 


Leu 


He 


Arg 
285 


Leu 


He 


Val 
295 


Cys 


Arg 


Asp 


Gin 


Phe 
300 


Lys 


Lys 


Leu 
310 


Leu 


Ala 


Tyr 


Glu 


Lys 

315 


Tyr 


Val 


Glu 


Lys 


Ser 


Thr 


Leu 


He 




325 










330 


Lys 


Asp 


Val 
340 


Glu 


Leu 


Glu 


Arg 


Gin 
345 


He 


Asn 


Ser 
355 


Leu 


Ser 


Thr 


Asn 


He 
360 


Phe 


Leu 


His 
370 


Gin 


Asp 


Leu 


Asp 


Phe 
375 


Ser 


Glu 


Ala 
385 


Asp 


Leu 


Phe 


Val 


Val 
390 


Asn 


Leu 


Thr 
400 


Cys 


His 


Glu 


Phe 


He 
4Q5 


Ala 


Pro 


Leu 
415 


Leu 


Leu 


Ser 


Glu 


Phe 
420 


Asn 


Asp 


Gly 
430 


Ala 


He 


He 


He 


Asn 
435 


Ser 


Gin 


Ala 

445 


He 


Leu 


Lys 


Gly 


Leu 
450 


Arg 


Pro 


Gin 
460. 


Trp 


Lys 


Lys 


Leu 


Met 
465 


Ser 


Thr 


Asn 


Trp 


He 


Lys 


Thr 


Ser 






475 






480 


Trp 


Gin 


Phe 
490 


Asn 


Gin 


Glu 


Gly 


Ser 
495 


Lys 


Thr 


Leu 
505 


Met 


Glu 


Asp 


Tyr 


Gin 
510 


Val 


Phe 


Asn 

520 


He 


Ala 


GlU 


Pro 


Pro 
525 


Leu 


Asn 


Asp 
535 


Met 


Thr 


Ser 


Lys 


Gly 
540 


Ser 


Phe 


Pro 
550 


Lys 


Pro 


He 


Leu 


Glu 
555 


Asn 


He 


Gly 
565 


Leu 


He 


Ala 


Glu 


ASn 
570 


Gly 


Val 


Trp 
580 


Tyr 


Asn 


He 


Val 


Asp 
585 


Val 


Ala 


Lys 
595 


He 


Leu 


Glu 


Asp 


Lys 
600 


Tyr 


Tyr 


Lys 

610 


He 


Asn 


Glu 


Ser 


Met 
615 


Ala 


Glu 


Asp 
625 


Gin 


Asp 


Arg 


Val 


Ala 
630 


Thr 


His 


He 
640 


Asn 


Thr 


Val 


Phe 


Asp 
645 


Val 


Tyr 


Lys 
655 


Asn 


Val 


val 


Ser 


Val 
660 


Ser 


Ala 


Ala 


Gin 


Phe 


Leu 


Phe 


Arg 
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665 670 675 

Phe Tyr Asn Ser Ala Ser Asp Pro Leu Asp Thr S r Ser Gly Gin 

680 685 690 

lie Thr Asn lie Gin Thr Pro Ser Gin Gin Asn Pro s r Asp Gin 

695 700 705 

' Glu Gin Gin Pro Pro Ala Ser Pro Thr Val Ser Met Asn His II 

710 715 720 

Asp Phe Ala Cys Val Ser Gly Ser Ser Ser Pro Val Leu Glu Pro 

725 730 735 

Leu Phe Lys Leu Val Asn Asp Glu Ala Ser Glu Gly Gin Val Lys 

740 745 750 

Ala Gly His Ala He Val Tyr Gly Asp Ala Thr Ser Thr Tyr Ala 

755 760 765 

Lys Glu His Val Asn Gly Leu Asn Glu Leu Phe Thr He He Ser 

770 775 780 

Arg He Tie Glu Asp 

785 



(6) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS i 
(A) LENGTH: 4 amino acids 
CB) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(iij MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
Tyr He Ser Lys 



(7) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Asp Val Glu Glu Tyr Gin Tyr Leu Arg 

5 
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(8) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

His Phe Leu Ser Ser Val Gin Arg 

5- 



(9) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) . SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Val Leu Asn Val Asn Thr Leu Pro Asn Gly Val Glu Tyr Gin 

5 10 



(10) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Ser Val Val Asn Glu Leu Val Gly Arg 

5 



(11) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
Glu Thr Phe Lys " 



(12) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D)* TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL : No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Leu Asp Tyr lie Lys 

5 



(13) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUlllCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

lie Leu Pro Val Arg 

5 
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(14) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Glu Val Asn Xaa Glu Lys 

5 



(15) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Phe Tyr Asp Xaa Xaa 

5 



(16) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Leu Xaa Ala Met Glu Val Phe Leu Asn Glu Xaa Pro Glu 

5 10 
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(17) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Tyr Thr Ser Ala Phe Trp Gly Glu Asn Phe Val Xaa Glu Leu 

5 10 



(18) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 
(D> TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Phe Gly Xaa Pro Gly Leu Glu lie Pro 

5 



(19) INFORMATION FOR SEQ ID NO: 18 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Xaa Gly Ser Val Met Gin 

5 
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(20) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Leu Pro Gly Ser Tyr Tyr Lys 

5 



(21) INFORMATION FOR SEQ. ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) Molecular type: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal . 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Asp Ala He Val Val Asn Pro Met Asp Ser Val Ala 

5 10 



(22) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Met He Ser He Leu 

5 
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(23) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

Arg Arg Pro Gin Trp Lys 

5 



(24) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 5 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Ser Xaa Pro Gin Lys 

5 



(25) INFORMATION FOR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal. 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Phe Tyr Arg Asn Leu Asn Gin Arg Phe Ala Asp Ala lie Val Lys 

5 10 15 
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(26) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Lin ar 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Asp Gly Ser Val Met Gin Xaa Xaa Gin Leu Xaa 

5" 10 



(27) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 18 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Asn Ala lie Asn Thr Ala Val Leu Glu Asn lie lie Pro Xaa Xaa 1 

5 10 15 

Xaa Val Lys 



(28) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

Leu Val Asn Asp Glu Ala Ser Glu Gly Gin Val Lys 

5- 10 
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(29) INFORMATION FOR SEQ ID NO: 28 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Xaa Gin Asp lie Leu Leu Asn Asn Thr Phe Xaa 

5 10 



(30) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

Asp Thr Thr Gin Thr Ala Pro Val Xaa Asn Asn Val Xaa Pro 

5 10 



(31) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

Asn Gin Leu Asp Ala Xaa Asn Tyr Ala Glu Val 

5 10 
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(32) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS : 

5 (A) LENGTH 10 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

10 

(iii) HYPOTHETICAL: Yes 

(iv) FRAGMENT TYPE: N-terminal 

15 (v) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

Asn Leu Ser Arg Trp Arg Asn Tyr Ala Glu 

5 10 

20 

(33) INFORMATION FOR SEQ ID NO: 32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 4 amino- acids 
25 (B) TYPE: Amino acid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
30 (iii) HYPOTHETICAL: Yes 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

35 

Trp Gin Gly Lys 



(34) INFORMATION FOR SEQ ID NO: 33 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

lie Gin Leu Gly Glu Ser Asn Asp Asp Xaa Xaa 
55 5 10 
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(35) INFORMATION FOR SEQ ID NO: 34 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

Gin Val Pro Thr lie Gin Asp Xaa Thr Asn Lys 

5 10 



(36) INFORMATION FOR SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal • 

(V) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

lie Tyr Xaa Tyr Val Lys 

5 



(37) INFORMATION FOR SEQ ID NO: 36 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

Asn Gin Leu Gly Asn Tyr 

5 
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(38) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 4 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Lin ar 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 37 
Val Ala Leu Thr 

(39) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 
(8) TYPE: Amino acid 
(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 38 

Asp Ala lie Val Val Asn Pro Xaa Asp Ser Val Ala 

5 10 

(40) INFORMATION FOR SEQ ID NO: 39 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-termihal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 39 

Thr Phe Thr Asn Tyr Asp Gly Ser Lys 

5 
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(41) INFORMATION FOR SEQ ID NO: 40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 10 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 40 

Thr Gly Asn Asp Pro. Ser His lie Ala Lys 

5 10 



20 (42) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 41 

35 lie Tyr Glu Ser Gin Gly Lys 

5 



(43) INFORMATION FOR SEQ ID NO: 42 
40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

45 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 42 

Ala Glu Gly Ala Thr Gly Gly Leu Val Pro His Lys 
55 5 10 



10 



15 
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(44) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-tenainal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 43 



15 



40 



45 



Leu Ala Thr Glu Leu Pro Ala Xaa Ser Lys 

5 10 



20 (45) INFORMATION FOR SEQ ID NO: 44 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 44 

35 Ser Leu Leu Asp Ala Gly Ala Lys 

5 



(46) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 



(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

50 (iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 45 

Glu Lys Pro Gin Asp Leu Asp Asp Asp Pro Leu Tyr Leu Thr 
55 5 10 
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(47) INFORMATION FOR SEQ ID NO:46 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 46 

Xaa Gin Xaa His Gin Asp Xaa Xaa Asn Leu Thr 

5 10 



(48) INFORMATION FOR SEQ ID NO: 47 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: - Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 47 

Phe Asn Asp Glu Ser lie lie He Gly Tyr Phe Xaa Xaa Ala Pro 

5 10 15 



(49) INFORMATION FOR SEQ ID NO: 48 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 48 

Ser Arg Leu Phe Leu Phe Asp Tyr Asp Gly Thr Leu Thr Pro 

5 10 
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(SO) INFORMATION FOR SEQ ID NO: 49 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amin acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 49 

Gin Leu Gly Asn Tyr Gly Phe Tyr Pro Val Tyr 

5 io 



(51) INFORMATION FOR SEQ. ID NO: 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal . 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 50 

Phe Leu Val Glu Asn Pro Glu Tyr Val Glu Lys 

5 10 



(52) INFORMATION FOR SEQ ID NO: 51 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPES Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 51 

Xaa He Thr Pro His Leu Thr Ala Xaa Ala Ala 

5 10 
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(53) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 52 

Thr Leu Met Glu Asp Tyr Gin Ser Ser Lys 

5 ' 10 



(54) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V). SEQUENCE DESCRIPTION FOR SEQ ID NO: 53 

lie Leu Glu Gly Leu Thr Gly Ala Asp Phe Val Gly Phe Gin Thr 

5 10 15 



(55) INFORMATION FOR SEQ ID NO: 54 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 14 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NOr54 

Gin lie Leu Xaa Pro Thr Leu Xaa Tyr Gin lie Pro Asp Asn 

5 10 
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(56) INFORMATION FOR SEQ ID NO: 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 55 

Phe Gly Gly Tyr Ser Asn Lys 

5 



(57) INFORMATION FOR SEQ ID NO: 56 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 23 amino acids 

(B) TYPE; Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 56 

Phe Xaa Thr Glu Asn Ala Glu Asp Gin Asp Xaa Val -Ala Xaa Val 

5 10 15 

lie Gly Xaa Ala lie Xaa Xaa He 

20 



(58) INFORMATION FOR SEQ ID NO: 57 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 18 amino acids 

(B) TYPE: Amino acid 
(c\ TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
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(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 57 

Xaa Val Gly Thr Val Gly He Pro Thr Asp Glu He Pro Glu Asn 

5 10 15 

He Leu Ala 



(59) INFORMATION FOR SEQ ID NO: 58 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 19 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

15 

(ii) MOLECULAR TYPE: Peptides 

(iii) HYPOTHETICAL: No 

20 (iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 58 

Leu Leu Val His Ser Leu Leu Asn Asn Thr Ser Gin Thr' Ser Leu 
25 5 10 15 

Glu Gly Pro Asn 



30 (60) INFORMATION FOR SEQ ID NO: 59 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 20 amino acids 

(B) TYPE: Amino acid 
35 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

40 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 59 

45 Ser Ser Thr Thr Asn Thr Ala Thr Leu Xaa Xaa Leu Val Ser Ser 

5 10 15 

Xaa He Phe Met Glu 

20 
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(61) INFORMATION FOR SEQ ID NO: 60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Lin ar 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 60 

Ala Xaa Asn Arg Pro Thr Ser Ala Ala Thr Ser Leu Val Asn 

5 10 



(62) INFORMATION FOR SEQ ID NO: 61 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH « amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY :• Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 61 

Xaa Phe Thr lie lie Xaa 

5 



(63) INFORMATION FOR SEQ ID NO: 62 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 15 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 62 

Asn Leu Thr Ala Asn Ala Thr Thr S r His Thr Pro Thr Ser 

5 io 
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(64 ) INFORMATION FOR SEQ ID NO: 63 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 63 

Phe Xaa Xaa Tyr Ser Asn Lys 

5 



(65) INFORMATION FOR SEQ ID NO: 64 

(i) SEQUENCE CHARACTERISTiCS : 

'(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal. 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 64 

Xaa Pro Xaa Ala Phe Asn Xaa 

5 



(66) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 65 

He Ala Ser Pro He Gin Xaa Glu 

5 
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(67) INFORMATION FOR SEQ ID NO: 66 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 66 

Gin Arg Pro Leu Leu Ala Lys 

5 



(68) INFORMATION FOR SEQ ID NO: 67 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 67 

Phe Phe Ser Pro Ser Ser Asn lie Pro Thr Asp Arg 

5 .10 



(69) INFORMATION FOR SEQ ID NO: 68 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 68 

Ala Leu Ser Asn Asn lie Ser Gin Glu 

5 
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(70) INFORMATION FOR SEQ ID NO: 69 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 6 amino acids 
5 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
10 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 69 

15 

Xaa Xaa Xaa Tyr Thr Pro 

5 



20 (71) INFORMATION FOR SEQ ID NO: 70 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 16 amino acids 

(B) TYPE: Amino acid 
25 (C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

30 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 70 

35 He Ala Ser Pro He Gin Gin Gin Gin Gin Asp Pro -Thr Ala Asn 

5 10 15 

Leu 



40 

(72) INFORMATION FOR SEQ ID NO: 71 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 amino acids 
45 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
50' (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 



Ill 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 71 

Thr Met Leu Lys Pro Arg 

5 



(73) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH 6 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 72 

He He Glu Asp Glu Ala 

5 



(74) INFORMATION FOR SEQ ID NO: 73 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 73 

He Thr Pro His Leu Thr Ala Ser Ala Ala Lys 

5 10 



(75) INFORMATION FOR SEQ ID NO: 74 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
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(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 74 

Ser Leu Val Ala Pro Ala Pro Glu 

5 



(76) INFORMATION FOR SEQ ID NO: 75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 12 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 75 

Lys Pro Gin Asp Leu Asp Asp Asp Pro Leu Tyr Leu 

• 5 10 



(77) INFORMATION FOR SEQ ID N0:76 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 6 

(B) TYPE.* Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 76 

Lys Tyr Ala Leu Leu Arg 

5 



(78) INFORMATION FOR SEQ ID NO: 77 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 11 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 
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(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 77 

Gin Leu Gly Asn Tyr Xaa Phe Tyr Pro Val Tyr 

5 10 



(79) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 8 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO:78 



Ala Phe Glu Asp His Ser Trp Lys 

5 



(80) INFORMATION FOR SEQ ID NO: 79 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 16 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 79 

Ala Gly His Ala lie Val Tyr Gly Asp Ala Thr Ser Thr Tyr Ala 

5 10 15 

Lys 



(81) INFORMATION FOR SEQ ID NO: 80 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 9 amino acids 

(B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 



(ii) MOLECULAR TYPE: Peptide 



10 



25 



30 
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(iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-terminal 

(V) SEQUENCE DESCRIPTION FOR SEQ ID NO: 80 

Glu Arg Leu Pro Gly Ser Tyr Tyr Lys 

5 

(82) INFORMATION FOR SEQ ID NO: 81 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 7 amino acids 
15 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
20 (iii) HYPOTHETICAL: No 

(iv) FRAGMENT TYPE: N-tenninal 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 81 



Thr Leu Met Glu Asp Tyr Gin 

5 



(83) INFORMATION FOR SEQ ID NO: 82 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 1098 amino acids 
35 (B) TYPE: Amino acid 

(C) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Polypeptide 
40 (iii) HYPOTHETICAL: Yes 

(v) SEQUENCE DESCRIPTION FOR SEQ ID NO: 82 

45 Met Ala Leu lie Val Ala Ser Leu Phe Leu Pro Tyr Gin Pro Gin 

5 10 15 

Phe Glu Leu Asp Thr Ser Leu Pro Glu Asn Ser Gin Val Asp Ser 

20 25 30 

Ser Leu Val Asn lie Gin Ala Met Ala Asn Asp Gin Gin Gin Gin 
50 35 40 45 

Arg Ala Leu Ser Asn Asn lie Ser Gin Glu Ser Leu Val Ala Pro 

50 55 60 

Ala Pro Glu Gin Gly Val Pro Pro Ala lie Ser Arg Ser Ala Thr 

65 70 75 

55 Arg Ser Pro Ser Ala Phe Asn Arg Ala Ser Ser Thr Thr Asn Thr 

80 85 90 
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xxc 


Ser 

0 wX 


Ser 

O CSX 


Ser 

wX 


Glu 


Thr 
xxxx 


Car 
OCX 


Cor 

OCX 


1x1 


fx lo 








275 










280 

w w 








£ O W 




AqTI Gill 
nOXX w JL Li 


Cav* 
ocx 


7v en pyn 


7a en 


A en 
nop 


T.OII 
XJCU 


Thr 
XlXx 


xxxx 


nXa 


irro 


Asp 


r*T i^ 
IslU 


CjIU 








290 










295 








300 




Tvr 1 

xyx vax 


Coy 

OCX 


Acsn T.oii 

nop XjCU 


Gl n 




7\ en 


nop 


Ala 
Ala. 


xjyS 


pin 

oxn 


TV mv« 

ASp 


Tyr 


Lys 


30 






305 
www 










JxU 










o ID 




Val Pro 

v ax * x>^ 


TrVS 
XX Jf w 


pho Glv* 

* X&C NJX Y 

320 


Glv 

NJXy 


xyxr 


OCX 


noil 


Xjy o 

^ 4& W 


OCX 


j->y o 


X All 

XJCU 


T ire 

xjy O 


T ire? 

Xiy O 

. -3 O U 




Tyr Ala 


Leu 


Leu Arcr 


Ser 

•J wX 


Ser 

w wX 


Gin 

w XX X 


Gin 

UXU 


XHSU 


Phe 

XT XXC 


Car 

OCX 


A rrf 


XjcU 


XTXVO 








335 










340 










35 


Trp Ser 


lie 


Val Pro 


Ser 


lie 


Lys 


Glv 
v»xy 


Asn 


Glv 
v»xy 


Ala 

nxa 




xjy o 


A en 
noil 








350 










355 

www 








www 




Ala lie 


Asn 


Thr Ala 


Val 


Leu 


Glu 


Asn 


Tie 

xxc 


lie 

X xc 


Pro 

XTX w 


His 

XX xo 


ax y 


Hi e 
Xx Xo 








365 










370 








375 




Val Lvs 

V GLX iJV 0 


Tm 


Val Glv 

v a x w x y 


Thr 

X XXX 


Val 
vox 


Gl v 
wX y 


Tie 
xxc 


n*»#» 
rxu 


XXXX 


ASp 


/Sin 

V71U 


Tlo 

lie 


n«»/K 

XTX O 


40 






380 

wow 










JQ J 












Glu Asn 


Tie 
x xc 


Leu Ala 
395 


Asn 


Tie 

X XC 


Ser 
OCX 




C«y 

OCX 

400 


XJCU 


xiys 


nop 


xjy S 


iyr 

405 




Asp Ser 


Tvr 

x jr x. 


Pro Val 


Leu 

JWWM» 


Thr 

xxxx 


Asn 


Asn 

AOL/ 


no ky 


Thr 
xxxx 


Phe 


T.vq 

xjy o 


Ala 
nla 


Ala 
nla 








410 










415 

X'W 










45 


Tyr Lys 


Asn 


Tvr Cvs 

x j x. wy s* 

425 


Lys 


Gin 

w X4& 


lie 

X XC 


Tjeu 


Trn 
430 


Pro 

XTX W 


Thr 
xxxx 


T.eii 
XjcU 


xlxo 


Tvr 

iyr 

*v w w 




Gin lie 


Pro 


Aso Asn 


Pro 


Asn 


Ser 

w^x 


Lvs 

XJjr o 


Ala 

AXO 


Phe 

XT xxc 


Glu 

UXU 


Afin 
nop 


His 
xxXo 


Cor 
Ocx 








440 










445 








450 

"S W V 




Trp Lys 


Phe 


Tvr Arcr 


Asn 


Leu 


Asn 


Gin 

w XI A 


Arcr 


Phe 


Ala 
Aid 


nop 


Al a 
nla 


Tl o 
XXc 


50 ' 






455 










460 








465 




Va 1 T,v<; 

v ax xjy 9 


Tip 

JL XC 


X y X XJjr o 


XJjr O 




nop 


rpK r 
xxxx 


T 1 o 

XXC 


irp 


116 


nis 


Asp 


Tyr 








470 










475 








480 




His Leii 


Met 


Leu Val 
485 


Pro 

XT X VJ 


Gin 




Val 
v ax 


490 


nop 


Va 1 
Val 


XjcU 


XTX O 


fne 
495 


55 


Ala Lys 


He 


Gly Phe 
500 


Thr 


Leu 


His 


Val 


Ser 
505 


Phe 


Pro 


Ser 


Ser 


Glu 
510 



116 





Val 


Phe 


Arg 


Cys 


Leu 


Ala 


Gin 


Arg 


Glu 


Lys 


11 


Leu 


Glu 


Gly 


Leu 










515 










520 










525 




Thr 


Gly 


Ala 


Asp 


Phe 
530 


Val 


Gly 


Phe 


Gin 


Thr 
535 


Arg 


Glu 


Tyr 


Ala 


Arg 
540 


5 


His 


Phe 


Leu 


Gin 


Thr 
545 


Ser 


Asn 


Arg 


Leu 


Leu 
550 


Met 


Ala 


Asp 


Val 


Val 
555 




His 


Asp 


Glu 


Glu 


Leu 


Lys 


Tyr 


Asn 


Gly 


Arg 


Val 


Val 


Ser 


Val 


Arg 










560 










565 










570 




Phe 


Thr 


Pro 


Val 


Gly 


He 


Asp 


Ala 


Phe 


Asp 


Leu 


Gin 


Ser 


Gin 


Leu 


10 










575 










580 










585 




Lys 


Asp 


Gly 


Ser 


Val 


Met 


Gin 


Trp 


Arg 


Gin 


Leu 


He 


Arg 


Glu 


Arg 










590 










595 










600 




Trp 


Gin 


Gly 


Lys 


Lys 
605 


Leu 


He 


Val 


Cys 


Arg 
610 


Asp 


Gin 


Phe 


Asp 


Arg 
615 


15 


He 


Arg 


Gly 


He 


His 


Lys 


Lys 


Leu 


Leu 


Ala 


Tyr 


Glu 


Lys 


Phe 


Leu 










620 










625 










630 




Val 


GlU 


Asn 


Pro 


GlU 
635 


Tyr 


Val 


Glu 


Lys 


Ser 
640 


Thr 


Leu 


He 


Gin 


He • 
645 




Cys 


He 


Gly 


Ser 


Ser 


Lys 


Asp 


Val 


Glu 


Leu 


Glu 


Arg 


Gin 


He 


Met 


20 








650 










655 










660 




He 


Val 


Val 


Asp 


Arg 


He 


Asn 


Ser 


Leu 


Ser 


Thr 


Asn 


He 


Ser 


He 










665 










670 










675 




Ser 


Gin 


Pro' 


Val 


Val 
680 


Phe 


Leu 


His 


Gin 


Asp 
685 


Leu 


Asp 


Phe. 


Ser 


Gin 
690 


25 


Tyr 


Leu 

- 


Ala 


Leu 


Ser 
695 


Ser 


Glu 


Ala 


Asp 


Leu 
700 


Phe 


Val 


Val 


Ser 


Ser 
705 




Leu. 


Arg 


Glu 


Gly 


Met 
710 


Asn 


Leu 


Thr 


Cys 


His 
715 


Glu 


Phe 


He 


Val 


Cys 
720 ' 




Ser 


Glu 


Asp 


Lys 


Asn 


Ala 


Pro 


Leu 


Leu 


Leu 


Ser 


Glu 


Phe 


Thr 


Gly 


30 










725 










730 










735 




Ser 


Ala 


Ser 


Leu 


Leu 
740 


Asn 


Asp 


Gly 


Ala 


He 
745 


He 


He 


Asn 


Pro 


Trp 
750 




Asp 


Thr 


Lys 


Asn 


Phe 


Ser 


Gin 


Ala 


He 


Leu 


Lys 


Gly 


Leu 


Glu 


Met 










755 










760 










765 


35 


Pro 


Phe 


Asp 


Lys 


Arg 
770 


Arg 


Pro 


Gin 


Trp 


Lys 
775 


Lys 


Leu 


Met 


Lys 


Asp 
780 




He 


He 


Asn 


Asn 


Asp 
785 


Ser 


Thr 


Asn 


Trp 


He 
790 


Lys 


Thr 


Ser 


Leu 


Gin 

795 




Asp 


He 


His 


He 


Ser 


Trp 


Gin 


Phe 


Asn 


Gin 


Glu 


Gly 


Ser 


Lys 


He 


40 








800 










805 










810 




Phe 


Lys 


Leu 


Asn 


Thr 
815 


Lys 


Thr 


Leu 


Met 


GlU 
820 


Asp 


Tyr 


Gin 


Ser 


Ser 
825 




Lys 


Lys 


Arg 


Met 


Phe 


Val 


Phe 


Asn 


He 


Ala 


GlU 


Pro 


Pro 


Ser 


Ser 










830 










835 










840 


45 


Arg 


Met 


He 


Ser 


He 
845 


Leu 


Asn 


Asp 


Met 


Thr 
850 


Ser 


Lys 


Gly 


Asn 


He 
855 




Val 


Tyr 


He 


Met 


Asn 


Ser 


Phe 


Pro 


Lys 


Pro 


He 


Leu 


Glu 


Asn 


Leu 










860 










865 










870 




Tyr 


Ser 


Arg 


Val 


Gin 


Asn 


He 


Gly 


Leu 


He 


Ala 


Glu 


Asn 


Gly 


Ala 


50 










875 










880 










885 




Tyr 


Val 


Ser 


Leu 


Asn 
890 


Gly 


Val 


Trp 


Tyr 


Asn 
895 


He 


Val 


Asp 


Gin 


Val 
900 




Asp 


Trp 


Arg 


Asn 


Asp 
905 


Val 


Ala 


Lys 


He 


Leu 
910 


Glu 


Asp 


Lys 


Val 


Glu 
915 


55 


Arg 


Leu 


Pro 


Gly 


Ser 
920 


Tyr 


Tyr 


Lys 


He 


Asn 
925 


Glu 


Ser 


Met 


He 


Lys 
930 
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Phe 


His 


Thr 


Glu 


JVcn Ala Gin A en 
noil Ala uxu nop 










935 


lie 


GlV 


Asp Ala 


He Thr His Tl 










950 


Gly 


lie 


His 


Ala 


Tvr Val Tvr Lvs 










965 


Val Gly 


Leu 


Ser 


Leu Ser Ala Ala 










980 


Asn 


Ser 


Ala 


Ser 


Asp Pro Leu Asp 










995 


Asn 


lie 


Gin 


Thr 


Pro Ser Gin Gin 










1010 


Gin 


Pro 


Pro 


Ala 


Ser Pro Thr Val 










1025 


Ala 


Cys 


Val 


Ser 


Glv Set" Cor* .Car 

>* J-jr OwJL OCJL 0C?X 










1040 


Lys 


Leu 


Val 


Asn 


Asn" Glu Ala Ser 










1055 


His 


Ala 


He 


Val 


Tvr Glv Asn Ala 

*jr A> nxu 










1070 


His 


Val 


Asn Gly 


T.eu Asn Glu T.on 










1085 


lie 


Glu 


Asp 










1098 





Gin Asp Arg Val Ala Ser 


Val 




940 


945 


noil 


J.X1X7 vai rne Asp His 


Arg 






you 


Asn 

nail 


Val Val Rpt Val Gin 
vox vox otsx; val VvXIi 


Gl r> 




970 


07c 


Gin 


Phe Leu Ph Arcr PHp 

* AJwM, X^IX Ai. y XT lies 


Tvr 




985 




Thr 


Ser fior Glv Gin Tlo 
wci uiy urJ.Il JLlc 


xiir 




1UUU 


1 nn ft 

1UUD 


nSIl 


rio ser Asp Gin Glu 


Gin 




i m r 


^ ft *% f\ 
102 0 


Ser 


Met Asn His lie Asp 


Fne 




1030 




Pro 


Val Leu Glu Pro Leu 


Fne 




1045 


1UDU 


Glu Gly Gin Val Lys Ala 


Giy 




1060 


IUOD 


Thr 


Ser Thr Tyr Ala Lys 


ulU 




1075 


1080 


Phe 


Thr lie lie Ser Arg 


lie 




1090 


1095 



(84) INFORMATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 5981 basepairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Doublestranded 
• (D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Genomic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomvces cerevisiae 

(B) STRAIN: S288C 

(C) HAPLOTYPE: Haploid 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Genomic 

(B) CLONES: 6 and 10 

(viii) POSITION IN GENOME 

(A) CHROMOSOME: 13 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
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GGCTCACATT 
GTGGTGATAA 
CAAAATGGGG 
ACGCATCACG 
5 CGTACTGGCG 
AAGAACCAAA 
GCGGAGACTA 
ACGGCTGCGT 
TAGCGTGGTC 

10 TCGATACGAC 
GGATCATGGT 
TTATTCTCTC 
CTTTCATCTG 
TGCCAAGGCA 

15 TTTTCACAAC 
CTTGTGGTGG 
GATGAAGTCG 
CAGCGCTGGT 
AGAAACCAGC 

20 CAGAAGAGTG 
GACTAACGAA 
TTGTGGGAAA 
GATGGCTCGA 
ATCGATCGAC 

25 TGCATGCAGA 
CAGCGCGAGG 
CTGCAATAGT 
ATTTGTGATT 
GTTTGCCAGC 

30 CTGAGATCTT 
CACGATTTCA 
GTCTGTTTAA 
GGAGAAGGGT 
GGTCCGCAAG 

35 ACAAGCACAC 
TACTTTTGTG 
ATAAATGTGT 
TATTGCGAAA 
TAAAAGGGGC 

40 CCCCGCCCCT 
TGTCTAAGTC 
AGGGGCGGGG 
GAAGGGGGTA 
AGCCAAGCTT 

45 CGTGTCATTG 
AAGAATACAA 
CTTTGTTTTT 
GAGAACTCGC 
TGACCAACAG 

50 TGGTCGCGCC 
GCCACCAGGT 
TGCCACTTTA 
CTGCGAATGC 
CCCCGGAAAA 

55 TCCCACGGAT 
CGAGAATTGC 



CCAAAAAGAA 
CAAGGTTAGG 
GGGTGCGTAA 
AACAGGCCAT 
TCTGCGGGCG 
TACACAGCAA 
GTGAAGCAGA 
GGTGGTAGCG 
ACCTGATAAC 
TGTTCGATGT 
CACTACGTCC 
ACAGTCTGCG 
CAGGAGTTAG 
GAACTGCCTT 
GAGACGCTAC 
TTTGGTCGTA 
CGAACTGCGT 
GGCGTTAAAA 
GCTGATGGCG 
TGCTTAAGCT 
GTACTCGTGT 
TATACATTAC 
CCCTACTAAT 
ACAGAAACGC 
GCCACTAGAC 
CAGGTGGGAT 
AGCGATAGTA 
TTTAATATAT 
GAAGATATTT 
GCACGACTTT 
ACC&TGTCAC 
CCGAGCAACG 
CGTTGGTGGC 
GGTACTTGCT 
CATCGTCAGT 
CGCGTGGGTA 
CCCGAGTTAT 
TTTCTGCTCT 
GAAACAGAGG 
GCATTTTGAT 
CGGCTGGCGG 
GTCCGGGCAG 
GAACCAAGGG 
AAATATAAAT 
CACATCCACC 
TAGCAACGCA 
GCCCTACCAA 
AGGTGGACTC 
CAACAACGTG 
AGCACCAGAA 
CACCCAGTGC 
GATGATCTTG 
AACTACCTCA 
ATGGTTCCGT 
CGCATCGCAT 
TTCGCCAATC 



CAGTTCGAAC 
ATATGGAAGT 
GATTGAAAGC 
AAATGTAATC 
ATGACGGCCA 
GAATCTGTGG 
CGAGAACAAG 
GTGCTACTGC 
AGGAACATAG 
TGGAGCTGGG 
AAGGTCTTGC 
GACCGGTCTC 
TTACAGGGCT 
GTCCAGGTGA 
CTTCATTCTT 
ATTCCCAGTG 
ATACGTGTAT 
ACCGGCCTGC 
GCTTTCTCGC 
GCCCTATAAG 
ACGACACGGA 
TCACCCATAA 
CTCATCAACA 
AATTCGGTTC 
ACTGACGAGA 
CGTGAACATG 
AAAAGAGGCG 
TTAATGCGGT 
TCCATTGTGT 
TTCCTGAGGC 
TATTAACGGC 
CGTTCCTeCG 
GACCGAGGGC 
TGGCGCCCCT 
ATAAAGCACT 
GCCGTGCCTC 
TATTCTAAAG 
TCTCGTCTCG 
GCCAGATAGA 
ATGGCGTATT 
GCCTGGCGCC 
CGGCTATGGT 
GCTAGCACAC 
ACTAATGTAA 
CACCCGTCGA 
AGATCAACAC 
CCACAATTCG 
ATCTCTCGTG 
CGCTTTCTAA 
CAAGGTGTCC 
TTTCAACCGC 
TCTCTTCGGA 
CATACGCCAA 
GGAACGATTC 
CGCCAATCCA 
CAACAGCAAC 



GATAAAGCTT 
TGAACAGAGA 
CTTGACTTTC 
CGATTCAACT 
AGTGCTGCTA 
TCAGACCATT 
GAGAAATGGG 
GGCGGCAGAG 
TGGTGGCATG 
ATGCTGGTAT 
ATGGGACCCA 
IGCATGTATA 
TGAAACTTTA 
TGTCCTGAGG 
TAGGCGATGC 
GTGTGTATAA 
ACTAGATCTG 
GATTAGAATC 
CCGTATTTTA 
CTAGTATTTG 
TGTATTGGAG 
CTGATTTAGC 
GACGGATTCT 
AAGGATAGAG 
GCGCGGTAGC 
CTGCCGGTGA 
CATACATCCT 
ACATAAGAAT 
GCTCAATGGA 
ACATGTGGCG 
TCTCTTTCGA 
GAGCCGATGG 
GCCGGTTTGG 
GTGTTTCACG 
ATAGTCGAAC 
GTCTGTGTGT 
CGGGCACCAT 
CTCAAAAATC 
AATTTCGAGA 
TGGGATTGCT 
CTCGCCGAAG 
GTCAGTTACC 
TCACCCTGGG 
CTAIAAATAT 
TTAAAAAACC 
AATGGCTCTC 
AGCTTGACAC 
AACATCCAGG 
CAACATCTCA 
CCCCAGCAAT 
GCCTCGTCTA 
CATATTCATG 
CAAGCAAGAC 
TTCTCCCCTT 
GCATGAGCAT 
AGCAGGACCC 



TTCACCGCTG 50 
TGAAAACGGA 100 
TTGGCTCGTG 150 
CGAAGGGTGA 200 
TGGAAGCAAG 250 
CGGAATGGAT 300 
TTGTGTGGAA 350 
ATTTACGATC 400 
TATGGACAAT 450 
GCGGCCAGTC 500 
TTAAATCAGT 550 
TGGAGTCATT 600 
GAAGTAAGGT 650 
ACAAATTACT 700 
AGCATATCGC 750 
GGTGGCTGGT 800 
GAAT ACTGA A 850 
CCATCTTTGA 900 
CGAAACGTGC 950 
CCATAGCAAC 1000 
CCGTTATGCG 1050 
ATGGTCTGA6 1100 
GTTCGTATGT 1150 
CCGCCAGCGA 1200 
GGCTAAGAAC 1250 
AGAAGATCCC 1300 
ACGCCAGTCG 1350 
GCCTAATCTA 1400 
CCCTGTGTTC 1450 
CCATCGTCAT 1500 
TCACTACCAT 1550 
TACTGGCTCC 1600 
CATCCTGTAC 1650 
GTGTAAACAA 1700 
CATCCATTTT 1750 
GTGGGAATGT 1800 
TGTAGTAACT 1850 
GCGTTCAGGG 1900 
AAACGGGTCA 1950 
TGCTCGAAAG 2000 
GGAGATAGGA 2050 
TAGGGAAGGA 2100 
GCCCCCGTCT 2150 
AAGGATCTAC 2200 
AAACAAAGCA 2250 
ATCGTGGCAT 2300 
CTCTCTCCCT 2350 
CTATGGCCAA 2400 
CAGGAATCAT 2450 
CTCAAGGAGT 2500 
CGACAAATAC 2550 
GAAAACTTGA 2600 
TATGCTTAAA 2650 
CTTCCAATAT 2700 
GACTCCGGTT 2750 
CACGACCAAC 2800 



119 



TTATTAAAGA ACGTCAACAA GTCATTGTTA GTGCACTCAC TGTTGAACAA 2850 
CACCTCACAA ACTAGCCTAG AAGGACCCAA CAACCACATT GTTACCCCGA 2900 
AATCGAGGGC 66GCAACAGG CCTACTTCGG CGGCTACTTC TTTAGTTAAT 2950 
AGGACCAAAC AAGGTTCGGC CTCCTCTGGA TCTTCTGGGT CTTCTGCGCC 3000 
ACCTTCCATT AAAAGGATTA CGCCCCACTT GACTGCGTCT GCTGCAAAAC 3050 
AGCGTCCCTT ATTGGCTAAA CAGCCTTCTA ATCTGAAATA TTCGGAGTTA 3100 
GCAGATATTT CGTCGAGTGA GACGTCTTCG CAGCATAATG AGTCGGACCC 3150 
GGATGATCTA ACTACTGCCC CTGACGAGGA ATATGTTTCT GATTTGGAAA 3200 
TGGATGACGC GAAGCAGGAC TACAAGGTTC CAAAGTTCGG CGGCTATTCC 3250 
AATAAATCTA AACTTAAGAA ATATGCGCTG TTAAGGTCAT CTCAGGAGCT 3300 
GTTTAGCCGT CTTCCATGGT CGATCGTTCC CTCTATCAAA GGTAATGGCG 3350 
CCATGAAGAA CGCCATAAAC ACTGCAGTCT TGGAGAATAT CATTCCGCAC 3400 
CGTCATGTTA AGTGGGTCGG TACCGTCGGA ATCCCAACGG ATGAGATTCC 3450 
GGAAAATATC CTTGCGAACA TCTCTGACTC TTTAAAAGAC AAGTACGACT 3500 
CCTATCCTGT CCTTACGGAC GACGACACCT TCAAAGCCGC ATACAAAAAC 3550 
TACTGTAAAC AAATCTTGTG GCCTACGCTG CATTACCAGA TTCCAGACAA 3600 
TCCGAACTCG AAGGCTTTTG AAGATCACTC TTGGAAGTTC TATAGAAACT 3650 
TAAACCAAAG GTTTGCGGAC GCGATCGTTA AAATCTATAA GAAAGGTGAC 3700 
ACCATCTGGA TTCATGATTA CCATTTAATG CTGGTTCCGC AGATGGTGAG 3750 
AGACGTCTTG CCTTTTGCCA AAATAGGATT TACCTTACAT GTCTCGTTCC 3800 
CCAGTAGTGA AGTGTTTAGG TGTCTGGCTC AGCGTGAGAA GATCTTAGAA 3850 
GGCTTGACCG GTGCAGACTT TGTCGGCTTC CAGACGAGGG AGTATGCAAG 3900 
ACATTTCTTA CAGACGTCTA ACCGTCTGCT AATGGCGGAC GTGGTACATG 3950 
ATGAAGAGCT AAAGTATAAC GGCAGAGTCG TTTCTGTGAG GTTCACCCCA 4Q0O 
GTTGGTATCG ACGCCTTTGA TTTGCAATCG CAATTGAAGG ATGGAAGTGT 4050 
CATGCAATGG CGTCAATTGA TTCGTGAAAG ATGGCAAGGG AAAAAACTAA 4100 
TTGTGTGTCG TGATCAATTC GATAGAATTA GAGGTATTCA CAAGAAATTG 4150 
TTGGCTTATG AAAAATTCTT GGTCGAAAAT CCGGAATACG TGGAAAAATC 4200 
GACTTTAATT CAAATCTGTA TTGGAAGCAG TAAGGATGTA GAACTGGAGC 4250 
GCCAGATCAT GATTGTCGTG GATAGAATCA ACTCGCTATC CACCAATATT 4300 
AGTATTTCTC AACCTGTGGT ' GTTTTTGCAT CAAGATCTAG ATTTTTCTCA 4350 
GTATTTAGCT TTGAGTTCAG AGGCAGATTT GTTCGTAGTC AGCTCTCTAA 4400 
GGGAAGGTAT GAACTTGACA TGTCACGAAT TTATCGTTTG TTCTGAGGAC 4450 
AAAAATGCTC CCCTACTGTT GTCAGAATTT ACTGGTAGTG CATCTTTATT 4500 
GAATGATGGC GCTATAATAA TTAACCCATG GGATACCAAG AACTTCTCAC 4550 
AAGCCATTCT CAAGGGGTTG GAGATGCCAT TCGATAAGAG AAGGCCACAG 4600 
TGGAAGAAAT TGATGAAAGA CATTATCAAC AACGACTCTA CAAACTGGAT 4650 
CAAGACTTCT TTACAAGATA TTCATATTTC GTGGCAATTC AATCAAGAAG 4700 
GTTCCAAGAT CTTCAAATTG AATACAAAAA CACTGATGGA AGATTACCAG 4750 
TCATCTAAAA AGCGTATGTT TGTTTTCAAC ATTGCTGAAC CACCTTCATC 4800 
GAGAATGATT TCCATACTGA ATGACATGAC TTCTAAGGGC AATATCGTTT 4850 
ACATCATGAA CTCATTTCCA AAGCCCATTC TGGAAAATCT TTACAGTCGT 4900 
GTGCAAAACA TTGGGTTGAT TGCCGAGAAT GGTGCATACG TTAGTCTGAA 4950 
CGGTGTATGG TACAACATTG TTGATCAAGT CGATTGGCGT AACGATGTAG 5000 
CCAAAATTCT CGAGGACAAA GTGGAGAGAT TACCTGGCTC GTACTACAAG 5050 
ATAAATGAGT CCATGATCAA GTTCCACACT GAAAATGCGG AAGATCAAGA 5100 
TCGTGTAGCT AGTGTTATCG GTGATGCCAT CACACATATC AATACTGTTT 5150 
TTGACCACAG AGGTATTCAT GCCTACGTTT ACAAAAACGT TGTTTCCGTA 5200 
CAACAAGTGG GACTTTCCTT ATCGGCAGCT CAATTTCTTT TCAGATTCTA 5250 
TAATTCTGCT TCGGATCCAC TGGATACGAG TTCCGGCCAA ATCACAAATA 5300 
TTCAGACACC ATCTCAACAA AATCCTTCAG ATCAAGAACA ACAACCTCCA 5350 
GCCTCTCCCA CTGTGTCGAT GAACCATATT GATTTCGCAT GTGTCTCTGG 5400 
TTCATCGTCT CCTGTGCTTG AACCATTGTT CAAATTGGTC AATGATGAAG 5450 
CAAGTGAAGG GCAAGTAAAA GCCGGACACG CCATTGTTTA TGGTGATGCT 5500 
ACTTCTACTT ATGCCAAAGA ACATGTAAAT GGGTTAAACG AACTTTTCAC 5550 
GATCATTTCA AGAATCATTG AAGATTAAAT TTTACCATTT TAAAATTTTA 5600 
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ATGTTCTTGG GTATGAACTT TTATTTTCAA CTGCTTATTA TATATCAATT 5650 
CTATAAATTT TTTTCTTCTC TCTAACGACC AATTATAAAA TTCATCCTCT 5700 
TATTTATTAC AGCATCTTAT ACATTATGTA TATGGGTAGC TATTATTCAT 5750 
TTTTGCTTCG TAAGGACTTT TTTTGTCAAC TTTTTCATCC TAAGCGGCTA 5800 
5 AAAGTGATTG GAGAGGAATG TCCAGGCGAC CAATGATAAA AACGCTTTCT 5850 
CTTGGAACAA GAAATAGGAG CAATTGACAG TTGTCGATGA ACAGCGAAAA 5900 
TAGTAAGATA ACCTTCAAGC CCAATATTCT AATTAAAGGC GTTTATATAT 5950 
TTGTACTTTA TGGTATGTGC ATATGTATTG T 5981 

10 

(85) INFORMATION FOR SEQ ID NO: 84 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH50 base pairs 
15 (B) TYPE: Nucleotide 

(C) STRANDEONESS : Ooublestranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Synthetic DNA 

20 

{iii) HYPOTHETICAL: No 
(iv) ANTISENSE: No 
25. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84 



CGGGAAGACA TAGAACTATG ACTACGGATA ACGCTAAGGC GCAACTGACC 50 

30 

(86) INFORMATION FOR SEQ ID NO: 85 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH48 basepairs 
35 (B) TYPE: Nucleotide 

(C) STRAND EDNESS : Doublestranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Synthetic DNA 

40 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85 



GGGCCCAACA ACACAATGGT TACCCCGAAA TCGAGGGCGG GCAACAGG 48 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13bis) 



A. The indications made beiow relate to the microorganism referred to in the description 

on page 25 ,line 16 , 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet j^] 

Name of depositary institution 

DSM-DEDTSCHE SAMMLUNG VON MIKKDQRGRNISMEW UND ZELLKDIOTEEN QtihH ' 

Address of depositary institution {including postal code and country) 

Mascheroder Weg 1 B, D-3300 Braunschvasig, Federal Republic of Germany 



Date of deposit 


Accession Number 




18 February 1992 


DSM 6928 





C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



In respect of those designations in which a European patent is sought, 
a saitple of the deposited znicroorganism will be made available only by 
the issue of such a sample to an expert nominated by the person requesting 
the sanple (Rule 28(4) EPC) until the publication of the mention of the 
grant of the European patent or until the date on vghich the application 
has been refused or is deemed to be withdrawn. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications arena for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed bdow will be submitted to the international Bureau \ater^p^fyih^g^»Mimniu^^tM^ s^i^ f trnTr s 'Accession 
Number of Deposit") 



For receiving Office use only 



{^] This sheet was received with the international application 



AuthonzEdrsfQcer 
Form PCT.T10/134 (July 1992) 



For international Bureau use only 



[""1 This sheet was received by the international Bureau 



on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe I3bis) 



A. The indications made below relate to use microorganism referred 10 tn the description 
on page 25 Jine 28 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet f~| 



Name of depositary institution 

DSM-DEOTSCHE SSMMEDNG VCN MIKROORGRNISMEN DND ZKTiTiKDCTUREN CSiSbH 



Address of depositary institution (including postal code and country} 

Mascheroder Weg 1 B, D-3300 Braunschweig, Federal Republic of Germany 



Date of deposit 

25 January 1993 


Accession Number 

DStt 7425 - 


C ADDITIONAL INDICATIONS (Uw*blankiin*appikabU\ This information is continued on an additional sheet Q 



In respect of those designations in which a European patent is sought, 
a sample of the deposited microorganism will be made available only by 
the issue of such a sample to an expert nominated by the person requesting 
the sanple (Bole 28 (4) EPC) until the publication of the mention of the 
grant of the European patent or until the date on which the application 
has been refused or is deemed to be withdrawn. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ft fine indications are not for aii denpuud States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed beiow will be suomitted to toe international Bureau uietispccminegaurat nature of inc 
Numoer of Deposit 9 ) 



indications cp. 'Accession 



For receiving Office use only 



Ibis sneet was received with tb international application; 




Form PCTRO/lll rMv locr?> 



For international Bureau use only 



P"| Ibis sheet was received by the international Bureau on: 



Authonzed ofiucer 
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1. Genes, which comprise at 1 ast one of th isolated and 
pur if i d DNA s qu nces coding for the short chain of about 57 
kDa and th long chains of about 99 JcDa and about 123 kDa of 
trehalose synthase and functional equivalents thereof. 

2. The genes of claim 1, wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the short chain of 
about 57 kDa having the SEQ ID NO: 2 or functional equivalents 
thereof . 

3. The genes of claim 1 wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 
about 123 kDa comprising the SEQ ID NO: 4 or functional 
equivalents thereof. 

4. The genes of claim 1 wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 
about 123 kDa comprising the SEQ ID NO: 82 or functional 
equivalents thereof. 

5. The genes of claim l wherein the isolated and purified DNA 
sequence encodes the amino acid sequences of the long chain of 
about 99 kDa having the SEQ ID NO: 29 - 38 and 44 - 49 or 
functional equivalents thereof. 

6. The genes of claim l, which are selected from the group 
consisting of TSS1, TSL1 and TSL2 and functional equivalents 
thereof . 

7. The genes of claim 6, wherein TSS1 comprises the open 
reading frame of SEQ ID NO: 1 or functional equivalents 
thereof. 

8. The genes of claim 6, wherein TSLl comprises the open 
reading frame of SEQ ID NO: 3 or functional equivalents thereof. 
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9. The genes of claim 6, wherein TSL1 comprises the open 
reading fram SEQ ID NO: 83 or functional quival nts thereof. 

10. The genes of claim 6, wherein TSL2 comprises the DNA 
5 sequences inserted into pALK756 (DSM7425) or functional 

equivalents thereof. 

11. A truncated TSL1 gene encoding a truncated form of the 123 
kDa long chain of trehalose synthase lacking up to 600 amino 

10 acids from one end preferably the N-terminus. 

12. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks up to 330 amino acids from the N-terminus end. 

15 13. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 250 - 450 amino acids from the N- terminus end. 

14. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 250 - 450, preferably about 325 amino acids from 

20 the N-terminus end. 

* 

15. The truncated TSL1 gene of claim 11, wherein the long 
chain lacks 209 amino acids from the N- terminus end. 

25 16. A vector which comprises at least one of the genes of 
claims 1-15. 

17. A DNA construct which comprises at least one of the genes 
of claims 1-15. 

30 

18. Host cells or organisms transformed with at least one of 
the vectors of claim 16. 

19. Host cells or organisms transformed with at least one of 
35 DNA constructs of claim 17. 

20. Host cells or organisms transformed with the truncated 
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TSL1 gene of claims 10-15. 



21. The transformed host cells or organisms of claims 18 or 
19, which express a trehalose synthase which xhibits 

5 tr halose-6-phosphate synthase activity activatable by 

fructose-6-phosphate and also trehalose-6-phosphatase activity. 

22. The transformed host cells or organisms of claim 20, which 
express a trehalose synthase that is less inhibited by 

10 phosphate than is the intact trehalose synthase. 

23. The transformed host cells or organisms of any of claims 
18 to 22, which are selected from a group consisting of plants, 
fungi, yeasts or bacteria. 

15 

24. The transformed host cells or organisms of claims 23, 
wherein the yeast is Saccharomyces cerevisiae . 

25. The transformed host cells or organisms of any of claims 18 
20 to 22, wherein parts of the organism have increased trehalose 

content as compared to the corresponding parts of the parent 
cell or organism when grown under the same conditions. 

26* The transformed host cells or organisms of any of claims 18 
25 to 22, wherein said cells or organisms produce a higher yield 
of ethanol from carbohydrate than the parent cell or organism* 

27. The transformed host cells or organisms of any of claims 
claims 18 to 22, wherein said cells or organisms are more 

30 resistant to heat, cold and water deprivation than are the 
parent cell or organism. 

28. A trehalose synthase produced by recombinant DNA techniques 
using at least one of the genes of claims 1 to 15. 

35 

29. A substantially purified trehalose synthase, which exhibits 
trehalose-6-phosphate synthase activity activatable by 
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fructose-6-phosphate and also trehalose-6-phosphatas activity. 

30. A substantially purified trehalose synthase f which exhibits 
trehalose-synthase activity less sensitive to inhibition by 
phosphate. 

31. The trehalose synthase of claims 28 to 30 , which . comprises 
one short chain of about 57 kDa and at least one of two long 
chains of about 99 IcDa and 123 kDa or truncated forms of the 
123 kDa chain. 

32. The trehalose synthase of claim 31, wherein the short 
chain comprises the amino acid sequence of SEQ ID NO: 2 or 
functional equivalents thereof. 

33. The trehalose synthase of claim 31, wherein the 123 kDa 
long chain comprises the amino acid sequence of SEQ ID NO: 4 or 
functional equivalents thereof. 

34. The trehalose synthase of claim 31, wherein the 123 kDa 
long chain comprises the amino acid sequence of SEQ ID NO: 82 
or functional equivalents thereof. 

35. The trehalose synthase of claim 31 , wherein the 
chain comprises a majority, preferably at least ten 
amino acid sequences SEQ ID N0:s 29 to 38 and 44 to 
functional equivalents thereof. 

3 6. The trehalose synthase of claim 31, wherein the 99 kDa long 
chain comprises the amino acid sequences encoded by the DNA 
sequence inserted into pALK756 (DSM7425) or functional 
equivalents thereof. 

37. A trehalose-6-phosphate synthase, which comprises the 57 
kDa polypeptide corresponding to the short chain of claim 31 
and having the amino acid sequence of SEQ ID. NO: 2 or 
functional equivalents thereof. 



99 kDa long 
of the 
49 or 
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38. A trehalose-6-phosphate phosphatas , which comprises an 
about 99 kDa polypeptide corresponding to the long chain of 
claims 33 - 34. 

5 39. A proc ss for producing ethanol by using the host cells or 
organisms of claims 22 or 23, wherein the yield of ethanol or 
its rate of production is greater than by using the 
untrans formed cell or organism. 

10 40. The process of claim 39, wherein the organism is 
distiller's yeast. 

41. The process of claim 39, wherein at least one of the DNA 
sequences of claims 1-15 are functionally combined with 

15 promoters active under fermentative conditions. 

42. A process for producing a crop plant which has increased 
resistance to water deprivation, heat and cold, comprising " 
transforming the plant by introducing at least one of the genes 

20 of claims 1-15 into said plant's tissue. 

43. A process for producing trehalose by cultivating a host or 
organism which has been transformed with at least one of the 
genes of claims 1-15. 

25 

44. The process according to claim 43, wherein the transformed 
host cell or organism is a fungus, yeast or. plant. 

45* The process according to claim 44, wherein the yeast is 
30 Saccharomvces cerevisiae . 

46. A process for producing trehalose enriched food products 
from plants by introducing at least one of the genes of claims 
1-15 and allowing said genes to express the trehalose 

35 synthase in the edible tissues of the plant. 

47. The process of claim 46, wherein the trehalose is purified 
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by conventional methods. 



48. The process of claim 46 , wherein the trehalose is used as 
such or as an unpurifi d homogenate. 

49. A method for selecting strains which have been transformed 
with at least one of the genes of claims 1-15, wherein the 
selection medium comprises galactose as the main carbon source. 
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SUBSTITUTE SHEET 
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Promoter 

TTGTTGCGAT TGTfCTGTTC CATCTGCACC AGAACAAAGA ACAAAAGAAC -396 
AAGGAACAAA GTCCAAGCAC GTCAGCGCTG TTTATAAGGG GATTGACGAG -346 
GGATCGGGCC TAGAGTGCCA GCGCGCCAGG GAGAGGGAGC CCCCTGGGCC -296 
CTCATCCGCA GGCTGATAGG GGTC ACCCCG C TGGGCAGGT CAGGGCAGGG -246 
GCTCTCAGGG GGGCGCCATG GACAAACTGC ACTGAGGTTC TAAGACACAT -196 
GTATTATTGT GAGTATG TAT ATATAG AGAG AGATTAAGGC GTACAGCCGG -146 
GTGGTAGAGA TTGATTAACT TGGTAGTCTT ATCTTGTCAA TTGAGTTTCT -96 
GTCAGTTTCT TCTTGAACAA GCACGCAGCT AAGTAAGCAA CAAAGCAGGC -46 
TAACAAACTA GGTACTCACA TACAGACTTA TTAAGACATA GAACT ATG +3 

Terminatpr 

ACC AAA AAC TGA TGA ACCCGATGCA AATGAGACGA TCGTCTATTC +1521 
CTGGTCCGGT TTTCTCTGCC CTCTCTTCTA TTCACTTTTT TTATACTTTA +1571 
TATAAAATTA TATAAATGAC ATAACTGAAA CGCCACACGT CCTCTCCTAT +1621 
TCGTTAACGC CTGTCTGTAG CGCTGTTACT GAAGCTGCGC AAGTAGTTTT +1671 
TTCACCGTAT AGGCC +1686 
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40 50 60 
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100 110 120 
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130 140 150 

INFDENAWFGYNEANQTFTNEIAKTMNHND 

160 170 180 

LIWVHDYHLMLVPEMLRVKIHEKQLQNVKV 

190 200 210 

GWFLHTPFPSSEIYR I L P V R O E ILKGVLSC 

220 230 240 

DLVGFHT-YDYAR HFLSSVOR VLNVNT-LPNG 

250 260 270 

V E Y O GRFVNVGAFPIGI DVDKFTDGLKKES 

280 290 300 

VQKRIQQLK E T F K GCKIIVGVDR L D Y I K G V 

310 320 330 

P Q K LHAMEVFLNEHPE WRGKVVLVQVAVPS 

340 350 360 

R G DVEEYOYLR SVVNELVGR INGQFGTVEF 

370 380 . 390 

VPIHFMHKSIPFEELISLYAVSDVCLVSST 

400 410 420 
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430 440 450 
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460 470 480 

V K K E V N W E K L Y K Y I S K YTSAFWGENFVHF. T. 
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Promoter 

TGTAGTAACT TATTG CGAAA TTTC TGCTCT TCTCGTCTCG CTCAAAAATC -392 
GCGTTCA GGG TAAAAGGGGC GAAA CAGAGG GCCAGAT AGA AATTTC GAGA -342 
AAACGGGTCA CCCCGCCCCT UCATTTTGAT ATGGCGTATT TGGGATTGCT -292 
TGCTCGAAAG TGTCTAAGTC CGGCTGGCGG GCCTGGCGCC CTCGCCGAAG -242 
GGAGATAGG A AGGGG CGGGG GTCCGGGCAG CGGCTATGGT GTCAGTTACC -192 
TAGGGAAGGA GAAGGGG GTA GAAC CAAGGG G CTAGCACAC TCACCCTGGG -142 
GCCCCCGTCT AGCCAAGCTT AA ATATAAAT ACTAATGTAA CTATAAA TAT -92 
AAGGATCTAC CGTGTCATTG CACATCCACC CACCCGTCGA TTAAAAAACC -42 
AAACAAAGCA AAGAATACAA TAGCAACGCA AGATCAACAC A ATG GCT +6 

Terminator 

TTC ACG ATC ATT TCA AGA ATC ATT GAA GAT JAA ATTTTACCAT +3307 
TTTAAAATTT TAATGTTCTT GGGTATGAAC TTTTATTTTC AACTGCTTAT +3357 
TATATAT CAA TTCTATAAAT TTTTTTCTTC TCTCTAACGA CCAATTATAA +3407 
AATTCATCCT CTTATTTATT ACAGCATCTT ATACATTATG TATATGGGTA +3457 
GCTATTATTC ATTTTTGCTT CGTAAGGACT TTTTTTGTCA ACTTTTTCAT +3507 
CCTAAGCGGC TAAAAGTGAT TGGAGAGGAA TGtccaggcg accaatgata +3557 
aaaacgcttt ctcttggaac aagaaatagg agcaattgac agttgtcgat +3607 
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